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MAMMALIAN GENES: DENDRITIC CELL PROSTAGLAND IN-LIKE TRANSPONDER (DC-PGT). HDTEA84. 
HSUD37R AND RANKL. HCC5 CHEMOKINE. DEUBIQUmNATING II AND 12 (DUB1 1. DUBI2X MD-1. MD2 AND 
CYCLIN E2. RELATED REAGENTS AND METHODS 

FIELD OF THE INVENTION 



5 The present invention pertains to compositions related to 

proteins which: function in cellular physiology, development, and 
differentiation of mammalian cells; exhibit sequence similarity to 
15 TNF receptors which function in controlling activation and 

expansion of mammalian cells,, e.g., cells of a mammalian immune 
10 system; or function in controlling the cell cycle and growth. In 
particular, it provides purified genes, proteins, antibodies, and 
related reagents useful, e.g., to separate or identify particular 
cell types, or to regulate activation, development, 
differentiation, and function o: various cell types, including 
IS hematopoietic cells,- which exhibit high structural similarity to 
proteins that exhibit the biological capacity to serve as a 
carrier mediated transporters of charged organic anions across 
cellular membranes, which typically can be used in prostaglandin 
and thromboxane physiology, e.g., transportation, influx, efflux, 
20 clearance, or degradation; which regulate or evidence development, 
differentiation, and function of various cell types, including 
hematopoietic cells; or to regulate ceil division and 
proliferation of various cell types, including tumor cells. 

35 25 BACKGROUND OF THE INVENTION 

Prostaglandins (PGs) and thromboxanes (TXs) play widespread 
physiological, and therapeutic roles in health and disease such as 
40 glaucoma; pregnancy, labor, delivery, and abortion; gastric 

30 protection and peptic ulcer formation; intestinal fluid secretion; 
liver protection and damage; airway resistance and asthma; blood 
pressure control; and modulation of inf lanmatory cells. 
45 PGs are charged anions at physiological pH that diffuse 

poorly across biologdcal membranes. This limited simple diffusion 
35 appears to be augmented by carrier mediated transport in many 

diverse tissues such as the lung, choroid plexus, liver, anterior 
So chamber of the eye, vagina, uterus, and placenta. 
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Understanding the role of prostaglandins in the development 
and functioning of the immune system is presently incomplete 
Specifically, the influence of prostaglandins (PCs) on antigen 
presenting cells (APCs) of the immune system (e.g.. dendritic 
cells) is, as yet, poorly understood. 

Dendritic cells (DCs) are the most potent of antigen 
presenting cells. See, e.g., Paul (ed. 1993) Fundamental 
Immunplpoy 3d ed. , Raven Press. NY. DCs are highly responsive to 
inflammatory stimuli such as bacterial lipopolysaccharides (LPS) 
and cytokines such as tumor necrosis factor alpha (TNFa) . The 
presence of cytokines and LPS can induce a series of phenotypic 
and functional changes in DC that are collectively referred to as 
maturation. See, e.g.. Banchereau and Schmit- Dendritic PpHc ^ 
Fundamental and Clinical Tmnm™ ] ^ , plenum Press, NY. 

Maturational changes in DCs include, e.g., silencing of 
antigen uptake by endocytosis, upregulation of surface molecules 
related to T cell activation, and active production of a number of 
cytokines including TNFa and IL-12 . Upon local accumulation of 
TNFa, DCs migrate to the T cell areas of secondary lymphoid organs 
to activate antigen specific T cells. 

Recent data indicate that DCs secrete PCs. See, e.g., 
Cormann. et al. (1986) Ann. Tn«t- p»^„. 137:369-382. 
Furthermore, PGE 2 has been shown to have an influence on DC 
maturity and the production of cytokines by DCs. Seem e.g., 
Kalinski, et al. (1997) J. Immunol. 159:28-35; Kuhn, et al.'(1997) 
Eur. J. Immunol 27:3135-3142; and Rieser, et al . (1997) J. Exn. 
Med. 186:1603-1608. 

Currently, a need exists to understand the manner in which 
PGs influence cells of the immune system. It seems likely that 
PGs, like cytokines, effect immune system development and 
activation. The present invention contributes to satisfying that 
need and is directed generally to a novel mammalian gene encoding 
a prostaglandin-like transporter (PGT) . 

In other aspects, the activation of resting T cells is 
critical to most immune responses and allows these cells to exert 
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their regulatory or effector capabilities. See. e.g., Pa ul (ed 
1993 ) Fundament ^ T Win olo ro 3d ed., Raven Press, N.Y. Increased 
adhesion between T cells and antigen presenting cells (A PC) or 
other forms of primary stimuli, e.g., Mobilized monoclonal 
5 antibodies (mAb) , can potentiate the T-cell receptor signals T 
cell activation and T cell expansion depends upon engagement of 
the T-cell receptor (TCR) and co-stimulatory signals provided by 
accessory cells. See, e.g., Jenkins and Johnson (1993, C urr 

Tmm " n01 5:361 " 367 <- Bierer and Hahn (1993) Semin. t 1 

10 5:249-261; June, et al. (1990) Immunol. TWUy li :211 _ 216 . ^ 

Jenkins (1994) Ir^unity. 1:443-446. A major, and well-studied, co- 
stimulatory interaction for T cells involves either CD28 or CTLA-4 
on T cells with either B7 or B 70 (Jenkins (1994, Immunity 1:443- 
446) . Recent studies on CD28 deficient mice (Shahinian et al 
(1993, Science 261:609-612; Green, et al . (1994) Immunity 1 :50 1- 
508) and CTLA-4 immunoglobulin expressing transgenic mice 
(Ronchese. et al. (1994) J, Sxp . MP d . 179:809-817) have revealed 
deficiencies in some T-cell responses though these mice have 
normal primary immune responses and normal CTL responses to 
lymphocytic choriomeningitis virus and vesicular stomatitis virus 
As a result, both these studies conclude that other co-stimulatory 
molecules must be supporting T-cell function. However, 
identification of these molecules which mediate distinct 
costimulatory signals has been difficult. 

Tumor Necrosis Factor (TNF) is the prototypic member of an 
emerging family of cytokines that function as prominent mediators 
of immune regulation and the inflammatory response. These ligands 
are typically type II membrane proteins, with homology at the 
carboxy terminus. A proteolytic processed soluble protein often 
30 is produced. See, e.g.. Smith, et al. (1994) Cell 76-959-962; 
45 Armitage (1994) Current Opinion in Tmrnn polo gx 6:407-413; Gruss and 

Dower (1995) Blood 85:3378-3404; Wiley, et al. (1995) immunity 
3:673-682; and Baker and Reddy (1996) Oncogene 12:1-9. Crucial 
roles for these family members are evidenced by a number of 
studies, and they are implicated in regulation of apoptosis. 
peripheral tolerance, Ig maturation and isotype switching, and 
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general B cell and T cell functions. See, e.g., Thomson (ed. 
1994) The Cytokine Handbook Academic Press, San Diego, CA; 
Naismith and Sprang (1998) Trends Biochera. Sci. 23:74-79; Lucas, 
et al. (1997) J. Leukoc. Biol. 61:551-558; Reddi (1997) Cell 
89:159-161; Van Deventer (1997) Gut 40:443-448; Jablonska (1997) 
Postepy. Hig. Med. Dosw. 51:567-575; Hill and Lunec (1996) Mol. 
Aspects Med- 17:455-509; Aderka (1996) Cytokine Growth Factor Rev. 
7:231-240; Lotz, et al . (1996) J. Leukoc. Biol. 60:1-7; and Gruss 
and Dower (1995) Cytokines Mol. Ther. 1:75-105. These imply 
fundamental roles in immune and developmental networks relevant to 
human therapeutic needs. The identification of ligands and cell 
surface receptors allow determination of pairs, which will be 
useful in modulating such signal transduction. 

The discovery of new cell markers is always potentially 
useful. Moreover, the inability to modulate activation signals 
prevents control of inappropriate developmental or physiological 
responses in the immune system. The present invention provides at 
least one alternative costimulatory molecule, which will be useful 
as a marker for cell types, and agonists and antagonists of which 
will be useful in modulating a plethora of immune conditions or 
responses . 

The circulating component of the mammalian circulatory system 
comprises various cell types, including red and white blood cells 
of the erythroid and myeloid cell lineages. See, e.g., Rapaport 
(1987) Introduction to Hematology (2d ed. ) Lippincott, 
Philadelphia, PA; Jandl (1987) Blood: Textbook of Hematology . 
Little, Brown and Co., Boston, MA.; and Paul (ed. 1993) 
Fundamental Immunology (3d ed.) Raven Press, N.Y. 

For some time, it has been known that the mammalian immune 
response is based on a series of complex cellular interactions, 
called the " immune network . " Recent research has provided new 
insights into the inner workings of this network. While it 
remains clear that much of the response does, in fact, revolve 
around the network-like interactions of lymphocytes, macrophages, 
granulocytes, and other cells, immunologists now generally hold 
the opinion that soluble proteins, known as lyraphokines, 
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cytokines, or monokines, play a critical role in controlling these 
cellular interactions. Thus, there is considerable interest in 
the isolation, characterization, and mechanisms of action of cell 
modulatory factors, an understanding of which should lead to 
significant advancements in the diagnosis and therapy of numerous 
medical abnormalities, e.g., immune system and other disorders. 

Lymphokines apparently mediate cellular activities in a 
variety of ways. They have been shown to support the 
proliferation, growth, and differentiation of the pluripotential 
hematopoietic stem cells into vast numbers of progenitors 
comprising diverse cellular lineages making up a complex immune 
system. These interactions between the cellular components are 
necessary for a healthy immune response. These different cellular 
lineages often respond in a different manner when lymphokines are 
administered in conjunction with other agents. 

The chemokines are a large and diverse superfamily of 
proteins. The superfamily is subdivided into two classical 
branches, based upon whether the first two cysteines in the 
chemokine motif are adjacent (termed the "C-C" branch), or spaced 
by an intervening residue ("C-X-C") . A more recently identified 
branch of chemokines lacks two cysteines in the corresponding 
motif, and is represented by the chemokines known as 
lymphotactins . Another recently identified branch has three 
intervening residues between the two cysteines, e.g., CX3C 
chemokines. See, e.g., Schall and Bacon (1994) Current Opinion in 
immunology 6:865-873; and Bacon and Schall (1996) Int, Arch. 
Allergy & Immunol. 109:97-109. 

Because the physiology mediated by these soluble molecules is 
so important, the discovery of novel chemokines will be important, 
both in diagnostic and therapeutic contexts. 

In addition, while the general importance of the regulation 
of protein synthesis is universally accepted, the general 
importance of protein degradation has not been fully appreciated. 
One mechanism of protein degradation is via ubiquitination signals 
and degradation pathways. Ubiguitin (Ub) is a highly conserved 76 
amino acid polypeptide that plays an important role in the 
regulation of protein degradation, cell-cycle progression, gene 
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transcription and signal transduction. The ubiquitination pathway 
is fine tuned and controlled, in part, by deubiquitination 
enzymes, which remove ubiquitin from proteins. Misregulation of 
the ubiquitination pathway may contribute problems in the protein 
quantity regulation, which may be associated, e.g., with malignant 
transformation, and oncogenesis through oncogenic counterparts of 
normally processed ubiquitinated proteins. Other clinical 
problems will often result from excessive or insufficient protein 
levels. Therefore, understanding the ubiquitination roles, e.g., 
in immune function, will increase our understanding of cell 
biology, which should have relevance, e.g., to malignant 
transformation. 

Furthermore, growth of normal resting B cells (also referred 
to as "B lymphocytes") involves two distinct steps. First, the 
resting cells are activated to pass from the Gg to Gj phase of the 
cell cycle. See, e.g., Alberts, et al. (eds. 1989) Molecular 
Biology of the Cell Garland Publ., NY; and Darnell, et al. (1990) 
Molecular Cell Biology Freeman, NY. Next, the activated cells are 
induced to proliferate. See, e.g., Paul, ed. (1989) Fundamental 
Immunology . 2nd ed. , Raven Press, NY; and the third edition. 
Several factors have been identified that induce growth of B 
cells, including interleukin-1 (IL-1) , IL-2, IL-4, IL-10, and IL- 
13. In addition, antibodies against certain B cell surface 
molecules have been demonstrated to promote B cell proliferation. 
T cells (also referred to as "T lymphocytes") are also induced to 
proliferate by certain factors, which include phytohemagglutinin, 
anti-T cell receptor monoclonal antibodies, anti-CD3 monoclonal 
antibodies, and other agents. 

B7 (CD80) and B70 (CD86) are the second "group" of molecules 
which strongly mediate B and T cell interaction. These molecules, 
on B cells, interact with their ligands CD28 and CTLA-4 on T 
cells. These interactions are major co-stimulatory signals for 
activation of both B and T cells . 

During the last 15 years, it has become apparent that B7 
(CD80) and B70 (CD86) play fundamental functions in T cell and B 
cell activation Numerous in vitro and in vivo experiments have 
demonstrated that these two pairs of molecules represent important 
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targets for immunosuppression. See, e.g., Banchereau, et al. 
(1994) Ann. Rev. Immunol. 12:881-922; van Kooten. et al. (1996) 
Adv. Immunol. 61:1-77; Linsley and Ledbetter (1993) Ann. Rev. 
Immunol. 11:191-212) . 

In 1995, another molecule called RP105 was cloned from mouse 
splenic cells. See Miyake, et al (1995) J . Immunol . 154:3333- 
334C. Monoclonal antibodies against RP105 also induce strong 
proliferation of mouse B cells and protects mouse B cells from 
irradiation-induced apoptosis in a similar fashion to anti-CD40 
antibody or CD40-ligand. See Miyake, et al . (1994) J. Exp. Med. 
180:1217-1224. 

The RP105 molecule and its ligand MD-1 may be an additional 
pair of molecules that play key roles in the activation of T cells 
and B cells. See Miyake, et al. (1998) J. Immunol. 161:1349-1353; 
and Chan, et al., (1S98) J. Exp. Med. 188:93-101 However, the 
human sequence of MD-1, has remained undetermined. The present 
invention provides this and also provides a previously undescribed 
second human homolog of mouse MD-1, (i.e., MD-2) . 

Many factors have been identified which influence the 
differentiation process of precursor cells, or regulate the 
physiology or migration properties of specific cell types. These 
observations indicate that other factors exist whose functions in 
immune function were heretofore unrecognized. These factors 
provide for biological activities whose spectra of effects may be 
distinct from known differentiation or activation factors. The 
absence of knowledge about the structural, biological, and 
physiological properties of the regulatory factors which regulate 
cell physiology in vivo prevents the modulation of the effects of 
such factors. Thus, medical conditions where regulation of the 
development or physiology of relevant cells is required remains 
unmanageable . 

Thus, significant therapeutic needs exist in the areas of 
cytokine regulation of physiology, protein degradation, and B cell 
signaling. The present invention provides important insights and 
developments in these areas . 

Cancer can occur in many tissues of the body. It results 
from a change in certain ceils that causes them to evade the 
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normal growth limiting mechanisms, e.g., to escape the feedback 
controls that normally stop cellular growth and reproduction after 
a given number of such cells have developed. Cell division and 
transcription are highly coordinated processes that play important 
roles in this feedback control. See, e.g.. Beeson, et al. (eds. 
1979) Textbook of Medicine . 15th ed., W.B. Saunders Co., 
Philadelphia, PA. ; DeVita, et al. (eds. 1997) Cancer: Principle 
and Practice of Oncology. 5th ed. , Lippincott, Philadelphia, PA; 
Neal and Hoskin (1997) Clinical Oncology: Basic Principles and 
Practice Oxford University Press, NY; Kastan (1997) Checkpoint 
Controls and Cancer CSH Press, NY; and Thomas (ed. 1996) Apoptosls 
and Cell Cycle Control in Cancer: Basic Mechanisms and 
Implications f or Treating Malignant Disease BIOS Scientific, 
Oxford. 

Molecules which function to regulate cell division play 
important roles in the controlled growth of various types of 
cells. Aberrations in these controls can lead to various disease 
states, e.g., oncogenesis, improper wound healing, developmental 
abnormalities, and metabolic problems. 

The cell cycle can be divided into four phases: the 
presynthetic phases (Go and Gi) ; the phase of DNA synthesis (S) ; 
and the postsynthetic phase (G2). See, e.g., Guyton (ed. 1976) 
Textbook of Medical Physiology . 5th ed. , W.B. Saunders Co., 
Philadelphia, PA.; Alberts, et al. (eds. 1994) Molecular Biology 
of the Cell. 3rd ed. , Garland Publishing, New York, NY; and 
Darnell, et al . (eds. 1990) Molecular Cell Biology . 2nd ed. , W.H. 
Freeman, New York, NY. Effective chemotherapeutic agents are 
often those which target diseased cells in the S phase, e.g., 
choriocarcinoma, acute lymphocytic leukemia, lyphocytic 
lymphosarcoma, Burkitfs lymphoma, Hodgkin's disease, testicular 
neoplasms, Wilm's tumor, and Ewing's sarcoma. Unfortunately, 
oncogenic cells not actively dividing are less sensitive to these 
agents . 

The lack of knowledge regarding the control of the cell cycle 
has hampered the ability of medical science to specifically 
regulate cell division or immune responses. The present invention 
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provides compositions which will be important in the control of 
cell division and transcription. 

SUMMARY OF THE INVENTION 

The present invention is based, in part, upon the 
characterization of the genes and products relating to the DC-PGT, 
HDTEA84, HSLJD37R, RANKL, HCC5 chemokine, Dubll, Dubl2, MD-1, MD- 
2, and cyclin E2 . It provides . nucleic acids, polypeptides, 
antibodies, and methods for making and using such compositions. 

In the DC-PGT embodiments, the invention provides an isolated 
or recombinant antigenic polypeptide comprising: a plurality of 
distinct segments, wherein each segment has identity to at least 
12 contiguous amino acids from the mature SEQ ID NO: 2; or at 
least 17 contiguous amino acids from the mature SEQ ID NO: 2. In 
certain embodiments, the plurality of segments includes one of at 
least 19 contiguous amino acids; or two of at least 15 contiguous 
amino acids. Other polypeptides include those wherein the 
polypeptide: comprises the mature SEQ ID NO: 2; binds with 
specificity to a polyclonal antibody which specifically binds to 
SEQ ID NO: 2; or the polypeptide: is a natural allelic variant of 
SEQ ID NO: 2; is at least 30 amino acids in length; exhibits at 
least two non-overlapping epitopes specific for SEQ ID NO: 2; is a 
synthetic polypeptide; is attached to a solid substrate; or is a 
5-fold or less conservative substitution from SEQ ID NO: 2. 
Fusion polypeptides are also provided, e.g., comprising first and 
second portions , the first portion comprising a sequence as 
described and the second portion comprising a detectable marker. 
Pharmaceutical compositions are made available, e.g., comprising a 
sterile polypeptide, as described, in a pharmaceutically 
acceptable carrier. 

Polynucleotide embodiments include an isolated or recombinant 
polynucleotide encoding a described polypeptide. Preferred forms 
will be such a polynucleotide which: comprises the mature 
polypeptide coding portion of SEQ ID NO: 1; or encodes the mature 
SEQ ID NO: 2. Preferred embodiments include wherein the 
polynucleotide is: a PCR product; a hybridization probe; a 
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mutagenesis primer; or made by chemical synthesis. Alternatively, 
the polynucleotide is; detectably labeled; a deoxyribonucleic 
acid; or double stranded. Also provided is an expression vector: 
comprising the described polynucleotide, including wherein the 
polypeptide specifically binds polyclonal antibodies generated 
against an inanunogen of mature SEQ id NO: 2; which selectively 
hybridizes under stringent hybridization conditions to a target 
polynucleotide sequence having at least 60 contiguous nucleotides 
from SEQ ID NO: 1; encodes a polypeptide having at least 50 
contiguous amino acid residues from mature SEQ ID NO: 2; or is 
suitable for transfection into a prokaryote or eukaryote host 
cell. Preferably, the host cell is: a mammalian cell; a bacterial 
cell; an insect cell; a prokaryote; a eukaryote; or a COS cell. A 
method is provided, e.g. , of making a polypeptide comprising 
expressing the vector in the host cell. 

Other polynucleotides include an isolated or recombinant 
polynucleotide which hybridizes to the coding portion of SEQ ID 
NO: 1 under stringent hybridization and wash conditions of at 
least 50" C, a salt concentration of less than 400 mM, and 50% 
formamide. Such a nucleic acid may be an expression vector, which 
may hybridize to the coding portion of SEQ ID NO: 1 under 
stringent hybridization and wash conditions of at least 60' C, a 
salt concentration of less than 200 mM, and 50% formamide. 
Preferably, the vector encodes a polypeptide which specifically 
binds an antibody generated against a mature SEQ ID NO: 2. 
Another embodiment will be such a polynucleotide which hybridizes 
to SEQ ID NO: 1, wherein the polynucleotide is: a PCR product; a 
hybridization probe; a mutagenesis primer; or made by chemical 
synthesis . 

Methods are provided, e.g., of modulating the physiology or 
development of a cell, comprising contacting the cell with an 
agonist or antagonist of a described polypeptide; of detecting the 
presence of a complementary polynucleotide in a sample, comprising 
contacting a described polynucleotide that selectively hybridizes 
with the complementary polynucleotide in the sample to form a 
detectable duplex; thereby indicating the presence of the 
polynucleotide in the sample; or for identifying a compound that 
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binds to a described polypeptide, comprising: incubating 
components comprising the compound and the polypeptide under 
conditions sufficient to allow the components to interact; and 
measuring the binding of the compound to the polypeptide. 

in TNF receptor-like embodiments, the invention further 
provides an isolated or recombinant polynucleotide encoding an 
antigenic polypeptide comprising at least 17 contiguous amino 
acids from: the mature polypeptide from SEQ ID NO: 6; the mature 
polypeptide from SEQ ID NO : 8 ; . the mature polypeptide from SEQ ID 
NO: 10; the mature polypeptide from SEQ ID NO: 12; the mature 
polypeptide from SEQ ID NO: 17; the mature polypeptide from SEQ ID 
NO: 19; the mature polypeptide from SEQ ID NO: 21; or the mature 
polypeptide from SEQ ID NO: 23. In preferred embodiments, such 
polynucleotide will encode all of the polypeptide of: signal 
processed SEQ ID NO: 6; signal processed SEQ ID NO: 8; signal 
processed SEQ ID NO: 10; signal processed SEQ ID NO: 12; signal 
processed SEQ ID NO: 17; SEQ ID NO: 19; SEQ ID NO: 21; or SEQ ID 
NO: 23. Other embodiments include such a polynucleotide, which 
hybridizes at 55° C, less than 500 mM salt, and 50% formamide to 
the: mature protein coding portion of SEQ ID NO: 5; signal 
processed coding portion of SEQ ID NO: 7; signal processed coding 
portion of SEQ ID NO: 9; signal processed coding portion of SEQ ID 
NO: 11; mature protein coding portion of SEQ ID NO: 16; 
polypeptide coding portion of SEQ ID NO: 18; polypeptide coding 
portion of SEQ ID NO: 20; or polypeptide coding portion of SEQ ID 
NO: 22. Other forms include those polynucleotides, comprising at 
least 35 contiguous nucleotides of: mature protein coding portion 
of SEQ ID NO: 5; signal processed coding portion of SEQ ID NO: 7; 
signal processed coding portion of SEQ ID NO: 9; signal processed 
coding portion of SEQ ID NO: 11; mature protein coding portion of 
SEQ ID NO: 16; polypeptide coding portion of SEQ ID NO: 18; 
polypeptide coding portion of SEQ ID NO: 20; or polypeptide coding 
portion of SEQ ID NO: 22. Various expression vectors are provided 
comprising such a polynucleotide. The invention also provides a 
host cell containing the expression vector, including a eukaryotic 
cell. 
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Methods are provided, e.g., making an antigenic polypeptide 
comprising expressing a recombinant polynucleotide; for detecting 
a polynucleotide, comprising contacting the polynucleotide with a 
probe that hybridizes, under stringent conditions, to az least 25 
contiguous nucleotides of the: mature protein coding portion of 
SEQ ID NO: 5; signal processed coding portion of SEQ ID NO: 7; 
signal processed coding portion of SEQ ID NO: 9; signal processed 
coding portion of SEQ ID NO: 11,- mature protein coding portion of 
SEQ ID NO: 16; polypeptide coding portion of SEQ ID NO: 18; 
polypeptide coding portion of SEQ ID NO: 20; or polypeptide coding 
portion of SEQ ID NO: 22; to form a duplex, wherein detection of 
the duplex indicates the presence of the polynucleotide. Kits are 
provided, e.g., for the detection of a described polynucleotide, 
comprising a compartment containing a probe that hybridizes, under 
stringent hybridization conditions, to at least 17 contiguous 
nucleotides of a described polynucleotide to form a duplex. 
Preferably, the probe is detectably labeled. 

Binding compounds are provided, including antibodies, 
comprising an antibody binding site which specifically binds to a 
polypeptide comprising at least 17 contiguous amino acids from: 
signal processed SEQ ID NO: 6; signal processed SEQ ID NO: 8; 
signal processed SEQ ID NO: 10; signal processed SEQ ID NO: 12; 
signal processed SEQ ID NO: 17; SEQ ID NO: 19; SEQ ID NO: 21; or 
SEQ ID NO: 23. Preferably, the antibody binding site is: 
selectively immunoreactive with the: signal processed SEQ ID NO: 
6; signal processed SEQ ID NO: 8; signal processed SEQ ID NO: 10; 
signal processed SEQ ID NO: 12; signal processed SEQ ID NO: 17; 
SEQ ID NO: 19; SEQ ID NO: 21; or SEQ ID NO: 23; raised against a 
purified or recombinantly produced human HDTEA.84 protein; raised 
against a purified or recombinantly produced human HSLJD37R 
protein; or in a monoclonal antibody. Fab, or F(ab)2,- or the 
binding compound is: an antibody molecule; a polyclonal antiserum; 
detectably labeled; sterile; or in a buffered composition. 

Such compositions allow various methods, including using the 
binding compound, comprising contacting the binding compound with 
a biological sample comprising an antigen, thereby forming a 
binding compound : antigen complex. Preferably, the biological 
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sample is from a human, and the binding compound is an antibody. 
Such also allow for production of a detection kit comprising the 
binding compound, and: instructional material for the use of the 
binding compound for the detection; or a compartment providing 
segregation of the binding compound. 

Polypeptides are also made available, e.g., a substantially 
pure or isolated antigenic polypeptide, which binds to the 
described binding composition, and further comprises at least 17 
contiguous amino acids from: signal processed SEQ ID NO: 6; signal 
processed SEQ ID NO: 8; signal processed SEQ ID NO: 10; signal 
processed SEQ ID NO: 12; signal processed SEQ ID NO: 17; SEQ ID 
NO: 19; SEQ ID NO: 21; or SEQ ID NO: 23. Preferred polypeptides 
include those which: comprise at least a fragment of at least 25 
contiguous amino acid residues from: a primate HDTEA84 protein; a 
primate HSLJD37R protein; or a rodent or primate RANKL protein; or 
are soluble polypeptides; are detectably labeled; are in a sterile 
composition; are in a buffered composition; bind to an sialic acid 
residue; are recombinantly produced; or have a naturally occurring 
polypeptide sequence. In other embodiments, the polypeptide 
comprises at least 17 contiguous amino acids from the: signal 
processed SEQ ID NO: 6; signal processed SEQ ID NO: 8; signal 
processed SEQ ID NO: 12; signal processed SEQ ID NO: 17; SEQ ID 
NO: 19; SEQ ID NO: 21; or SEQ ID NO: 23. 

Methods are provided, including a method of modulating a 
precursor cell physiology or function comprising a step of 
contacting the cell with: a binding compound which binds to a 
described polypeptide; an HDTEA84 polypeptide; an HSLJD37R 
polypeptide; or a RANKL polypeptide. The method may be one 
wherein the contacting is in combination with a TNF family ligand, 
or an antagonist of the TNF family ligand. 

In other embodiments, the present invention, provides 
compositions related to other chemokine. Dub, or surface protein 
genes. Polypeptide embodiments include: a substantially pure or 
recombinant HCC5 polypeptide exhibiting identity over a length of 
at least 12 amino acids to SEQ ID NO: 25; an isolated natural 
sequence HCC5 of mature SEQ ID NO: 25; a fusion protein comprising 
HCC5 sequence; a substantially pure or recombinant Dubll 
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polypeptide exhibiting identity over a length of at least about 12 
amino acids to SEQ ID NO: 32 or 34; an isolated natural sequence 
Dubll of mature SEQ ID NO: 32 or 34,- a fusion protein comprising 
Dubll sequence; a substantially pure or recombinant Dubl2 
polypeptide exhibiting identity over a length of at least about 12 
amino acids to SEQ ID NO: 3 6 or 38; an isolated natural sequence 
Dubl2 of mature SEQ ID NO: 3 6 or 38; a fusion protein comprising 
Dubl2 sequence; a substantially pure or recombinant MD-1 
polypeptide exhibiting identity over a length of at least about 12 
amino acids to SEQ ID NO: 42; an isolated natural sequence MD-1 of 
mature SEQ ID NO: 42; a fusion protein comprising primate MD-1 
sequence; a substantially pure or recombinant MD-2 polypeptide 
exhibiting identity over a length of at least about 12 amino acids 
to SEQ ID NO: 44 or 46; an isolated natural sequence MD-2 of 
mature SEQ ID NO: 44 or 46; a fusion protein comprising primate 
MD-2 sequence; a substantially pure or recombinant MD-2 
polypeptide exhibiting identity over a length of at least about 12 
amino acids to SEQ ID NO: 48 or 49; an isolated natural sequence 
MD-2 of mature SEQ ID NO: 48; or a fusion protein comprising 
murine MD-2 sequence. Preferred embodiments include substantially 
pure or isolated polypeptides which match the sequences over a 
stretch of at least 17 amino acids; more preferably over a stretch 
of at least 21 amino acids; over 25, 30, 35, 50, 75 or more. In 
other preferred embodiments, the HCC5 polypeptide: is from a 
primate, including a human; comprises at least one polypeptide 
segment of SEQ ID NO: 25; exhibits a plurality of portions 
exhibiting the identity; is a natural allelic variant of HCC5; has 
a length at least about 30 amino acids; exhibits at least two non- 
overlapping epitopes which are specific for a primate HCC5; 
exhibits a sequence identity over a length of at least 35 amino 
acids to a HCC5; is glycosylated; is a synthetic polypeptide; is 
attached to a solid substrate; is conjugated to another chemical 
moiety; is a 5-fold or less substitution from natural sequence; or 
is a deletion or insertion variant from a natural sequence; or the 
Dubll polypeptide: is from a primate, including a human; comprises 
at least one polypeptide segment of SEQ ID NO: 32 or 34; exhibits 
a plurality of portions exhibiting the identity; is a natural 
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allelic variant of Dubll; has a length at least about 30 amino 
acids; exhibits at least two non-overlapping epitopes which are 
specific for a primate Dubll; exhibits a sequence identity over a 

10 length of at least about 35 amino acids to a Dubll; is 

5 glycosylated; is a synthetic polypeptide; is attached to a solid 
substrate; is conjugated to another chemical moiety; is a 5-fold 
or less substitution from natural sequence; or is a deletion or 

15 insertion variant from a natural sequence; or the Dubl2 

polypeptide: is from a primate,, including a human; comprises at 
10 least one polypeptide segment of SEQ ID NO: 36 or 38; exhibits a 
plurality of portions exhibiting the identity; is a natural 

20 allelic variant of Dubl2; has a length at least about 30 amino 

acids; exhibits at least two non-overlapping epitopes which are 
specific for a primate Dubl2; exhibits a sequence identity over a 
15 length of at least about 35 amino acids to a Dubl2; is 

25 glycosylated; is a synthetic polypeptide; is attached to a solid 

substrate; is conjugated to another chemical moiety; is a 5-fold 
or less substitution from natural sequence; or is a deletion or 
insertion variant from a natural sequence; or the primate MD-1 
20 polypeptide: is from a human; comprises at least one polypeptide 
segment of SEQ ID NO: 42; exhibits a plurality of portions 
exhibiting the identity; is a natural allelic variant of primate 
MD-1; has a length at least about 30 amino acids; exhibits at 
least two non-overlapping epitopes which are specific for a 

35 

25 primate MD-1; exhibits a sequence identity over a length of at 

least about 35 amino acids to a primate MD-1; is glycosylated; is 
a synthetic polypeptide; is attached to a solid substrate; is 
conjugated to another chemical moiety; is a 5-fold or less 
substitution from natural sequence; or is a deletion or insertion 
30 variant from a natural sequence; or the primate MD-2 polypeptide: 
is from a human; comprises at least one polypeptide segment of SEQ 
ID NO: 44 or 46; exhibits a plurality of portions exhibiting the 
45 identity; is a natural allelic variant of primate MD-2; has a 

length at least about 30 amino acids; exhibits at least two non- 
35 overlapping epitopes which are specific for a primate MD-2; 

exhibits a sequence identity over a length of at least about 35 
50 amino acids to a primate MD-2; is glycosylated; is a synthetic 
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polypeptide; is attached to a solid substrate; is conjugated to 
another chemical moiety; is a 5-fold or less substitution from 
natural sequence; or is a deletion or insertion variant from a 
natural sequence; or the rodent MD-2 polypeptide: is from a mouse; 
comprises at least one polypeptide segment of SEQ ID NO: 48 or 49; 
exhibits a plurality of portions exhibiting the identity; is a 
natural allelic variant of rodent MD-2; has a length at least 
about 30 amino acids; exhibits at least two non-overlapping 
epitopes which are specific for. a rodent MD-2; exhibits a sequence 
identity over a length of at least about 35 amino acids to a 
rodent MD-2; is glycosylated; is a synthetic polypeptide; is 
attached to a solid substrate; is conjugated to another chemical 
moiety; is a 5-fold or less substitution from natural sequence; or 
is a deletion or insertion variant from a natural sequence. 
Sterile compositions comprising such polypeptides are also 
provided, along with those comprising: the HCC5 polypeptide and: a 
carrier, wherein the carrier is: an aqueous compound, including 
water, saline, and/or buffer; and/or formulated for oral, rectal, 
nasal, topical, or parenteral administration,- another chemokine, 
including one selected from the group of HCC1, HCC2, HCC3 , and 
HCC4; or an antibody antagonist for a chemokine, including one 
selected from the group of HCC1, HCC2, HCC3 , and HCC4 ; the Dubll 
polypeptide and a carrier, wherein the carrier is: an aqueous 
compound, including water, saline, and/or buffer; and/or 
formulated for oral, rectal, nasal, topical, or parenteral 
administration; the Dubl2 polypeptide and a carrier, wherein the 
carrier is: an aqueous compound, including water, saline, and/or 
buffer; and/or formulated for oral, rectal, nasal, topical, or 
parenteral administration; the MD-1 polypeptide and a carrier, 
wherein the carrier is: an aqueous compound, including water, 
saline, and/or buffer; and/or formulated for oral, rectal, nasal, 
topical, or parenteral administration,- the MD-2 polypeptide and a 
carrier, wherein the carrier is: an aqueous compound, including 
water, saline, and/or buffer; and/or formulated for oral, rectal, 
nasal, topical, or parenteral administration. 

Fusion proteins are provided, e.g., comprising: mature 
protein sequence of SEQ ID NO: 25; mature protein sequence of SEQ 
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ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36 or SEQ ID NO: 38; mature 
protein sequence of SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 46, 
SEQ ID NO: 48, or SEQ ID NO: 49; a detection or purification tag, 
including a FLAG, His6, or Ig sequence; or sequence of another 
chemokine protein with the chemokine polypeptide Kits are 
provided, e.g., comprising a described polypeptide and: a 
compartment comprising the polypeptide; and/or instructions for 
use or disposal of reagents in the kit. 

Binding compounds, including antibodies, are provided, e.g., 
comprising an antigen binding portion from an antibody, which 
specifically binds to a natural: HCC5 polypeptide, wherein the 
antibody: is raised against a peptide sequence of a mature HCC5 
polypeptide sequence of SEQ ID NO: 25; is raised against a mature 
HCC5; is raised to a purified HCC5 ; is immunoselected; is a 
polyclonal antibody; binds to a denatured HCC5; or exhibits a Kd 
to HCC5 antigen of at least 30 uM; or Dubll polypeptide, wherein 
the antibody: is raised against a peptide sequence of a mature 
Dubll polypeptide sequence of SEQ ID NO: 3 2 or SEQ ID NO: 34; is 
raised against a mature Dubll; is raised to a purified Dubll; is 
immunoselected; is a polyclonal antibody; binds to a denatured 
Dubll; or exhibits a Kd to Dubll antigen of at least 30 JIM; or 

Dubl2 polypeptide, wherein the antibody: is raised against a 
peptide sequence of a mature Dubl2 polypeptide sequence of SEQ id 
NO: 36 or SEQ ID NO: 38; is raised against a mature Dubl2 ; is 
raised to a purified Dubl2; is immunoselected; is a polyclonal 
antibody; binds to a denatured Dubl2; or exhibits a Kd to Dubl2 
antigen of at least 30 UM; or a primate MD-1 polypeptide, wherein 
the antibody: is raised against a peptide sequence of a mature 
polypeptide sequence of SEQ ID NO: 42; is raised against a mature 
MD-1; is raised to a purified MD-1; is immunoselected; is a 
polyclonal antibody; binds to a denatured MD-1; or exhibits a Kd 
to MD-1 antigen of at least 30 UM; or a primate MD-2 polypeptide, 

wherein the antibody: is raised against a peptide sequence of a 
mature MD-2 polypeptide sequence of SEQ ID NO: 44, or SEQ ID NO: 
46; is raised against a mature MD-2; is raised to a purified MD-2; 
is immunoselected; is a polyclonal antibody; binds to a denatured 
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MD-2; or exhibits a Kd to MD-2 antigen of at least 30 UM; or a 
rodent MD-2 polypeptide, wherein the antibody: is raised against a 
peptide sequence of a mature MD-2 polypeptide sequence of SEQ ID 
NO: 48, or SEQ ID NO: 49; is raised against a mature rodent MD-2; 
is raised to a purified rodent MD-2; is immunoselected; is a 
polyclonal antibody; binds to a denatured rodent MD-2; or exhibits 
a Kd to antigen of at least 30 fiM. in certain embodiments, the 
binding composition will be one wherein: the polypeptide is from a 
primate or rodent; the binding compound is an Fv, Fab, or Fab2 
fragment; the binding compound is conjugated to another chemical 
moiety; is attached to a solid substrate, including a bead or 
plastic membrane; is in a sterile composition; or is detectably 
labeled, including a radioactive or fluorescent label. 

Kits are provided comprising the binding compound, and: a 
compartment comprising the binding compound; a compartment 
comprising purified antigen; and/or instructions for use or 
disposal of reagents in the kit. Methods are provided for 
producing an antigen : antibody complex, comprising contacting an 
antibody and: a primate HCC5 polypeptide; a primate Dubll 
polypeptide; a primate Dubl2 polypeptide,- a primate MD-1 
polypeptide; a primate MD-2 polypeptide; or a rodent MD-2 
polypeptide; thereby allowing the complex to form, other 
compositions are provided, e.g., the binding compound and: a 
carrier, wherein the carrier is: an aqueous compound, including 
water, saline, and/or buffer; and/or formulated for oral, rectal, 
nasal, topical, or parenteral administration; or an antibody 
antagonist for another chemokine, including one selected from the 
group of HCC1, HCC2, HCC3, and HCC4. 

Nucleic acid embodiments include, e.g., an isolated or 
recombinant nucleic acid encoding a polypeptide or fusion protein 
described, wherein: the HCC5: polypeptide is from a primate, 
including a human; or nucleic acid: encodes an antigenic HCC5 
peptide sequence of SEQ ID NO: 25 encodes a plurality of antigenic 
HCC5 peptide sequences of SEQ ID NO: 25; exhibits identity over at 
least 25 nucleotides to a natural cDNA encoding the HCC5 segment; 
is a hybridization probe for a gene encoding the HCC5 polypeptide; 
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or further encodes another chemokine, including one selected from 
the group of HCC1, HCC2. HCC3. and HCC4; or the Dubll: polypeptide 
is from a primate, including a human; or nucleic acid: encodes a 
Dubll antigenic peptide sequence of SEQ ID NO: 32; or SEQ ID NO: 
34 ; encodes a plurality of antigenic peptide sequences of SEQ ID 
NO: 32 or SEQ ID NO: 34; exhibits identity over at least 25 
nucleotides to a natural cDNA encoding the Dubll segment; or is a 
hybridization probe for a gene encoding the Dubll polypeptide; the 
Dubl2: polypeptide is from a primate, including a human; or 
nucleic acid: encodes an antigenic Dubl 2 peptide sequence of SEQ 
ID NO: 36 or SEQ ID NO: 38; encodes a plurality of antigenic 
peptide sequences of SEQ ID NO: 3 6 or SEQ ID NO: 38; exhibits 
identity over at least 25 nucleotides to a natural cDNA encoding 
the DUB12 segment; is a hybridization probe for a gene encoding 
the Dubl2 polypeptide; or the primate MD-1: polypeptide is from a 
primate, including a human; or nucleic acid: encodes an antigenic 
MD-1 peptide sequence of SEQ ID NO: 42; encodes a plurality of 
antigenic peptide sequences of SEQ ID NO: 42; exhibits identity 
over at least 25 nucleotides to a natural cDNA encoding the MD-1 
segment; is a hybridization probe for a gene encoding the Dubll 
polypeptide; or the primate MD-2 : polypeptide is from a human,- or 
nucleic acid: encodes an antigenic MD-2 peptide sequence of SEQ ID 
NO: 44, or SEQ ID NO: 46; encodes a plurality of antigenic peptide 
sequences of SEQ ID NO: 44, or SEQ ID NO: 46; exhibits identity 
over at least 25 nucleotides to a natural cDNA encoding the 
segment; is a hybridization probe for a gene encoding the primate 
MD-2 polypeptide; or the rodent MD-2: polypeptide is from a mouse; 
or nucleic acid: encodes an antigenic MD-2 peptide sequence of SEQ 
ID NO: 48, or SEQ ID NO: 49; encodes a plurality of antigenic 
peptide sequences of SEQ id NO: 48, or SEQ ID NO: 49; exhibits 
identity over at least 25 nucleotides to a natural cDNA encoding 
the MD-2 segment; or is a hybridization probe for a gene encoding 
the rodent MD-2 polypeptide. Other nucleic acid embodiments 
include the described, which: is an expression vector; further 
comprises an origin of replication; is from a natural source; 
comprises a detectable label; comprises synthetic nucleotide' 
sequence is less than 6 kb, preferably less than 3 kb; is from a 
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primate, including a human; comprises a natural full length coding 
sequence; or is a PCR primer, PCR product, or mutagenesis primer. 

Various cells are provided, including a cell or tissue 
comprising a described recombinant nucleic acid, including wherein 
^ the cell is: a prokaryotic cell; a eukaryotic cell; a bacterial 
cell; a yeast cell; an insect cell; a mammalian cell; a mouse 
cell,- a primate cell; or a human cell. 
15 Kits are Provided, e.g., comprising a described nucleic acid, 

and: a compartment comprising the nucleic acid; a compartment 
10 comprising a nucleic acid encoding another chemokine, including 

HCC1, HCC2. HCC3, and HCC4; or instructions for use or disposal of 
20 reagents in the kit. 

Alternative nucleic acids include those which: hybridize 
under wash conditions of 45° C and less than 2M salt to the 
polypeptide coding portion of SEQ ID NO: 24; hybridize under wash 
conditions of 45° C and less than 2M salt to the polypeptide 
coding portions of SEQ ID NO: 31 or 33; hybridize under wash 
conditions of 45° C and less than 2M salt to the coding portions 
of SEQ ID NO: 35 or 37; hybridize under wash conditions of 45° C 
and less than 2M salt to the coding portion of SEQ ID NO: 41 ; 
hybridize under wash conditions of 45° C and less than 2M salt to 
the coding portion of SEQ ID NO: 43 or 45. or hybridize under wash 
conditions of 45° C and less than 2M salt to the coding oortion of 
35 SEQ 10 N0: 47 • P^ferably, the wash conditions are at 55° C 

25 and/or 500 mM salt; or at 65° C and/or 150 mM salt. 

Additionally, methods are provided, e.g., of modulating 
physiology or development of a cell or tissue culture cells 
comprising exposing the cell to an agonist or antagonist of HCC5, 
primate MD-1, primate MD-2, or rodent MD-2 . Others include 
30 methods of detecting specific binding to a compound, comprising: 
contacting the compound to a composition selected from the group 
of: an antigen binding site which specifically binds to: an HCC5 
chemokine; a Dubll; a Dubl2; a primate MD-1; a primate MD-2; a 
rodent MD-2; or an expression vector encoding: an HCC5 chemokine 
35 or fragment thereof; a Dubll or fragment thereof; a Dubl2 or 

fragment thereof; a primate MD-1 or fragment thereof; a primate 
MD-2 or fragment thereof; or a rodent MD-2 or fragment thereof; a 
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substantially pure protein which is specifically recognized by the 
antigen binding site of the described antigen binding sites; a 
substantially pure HCC5 chemokine or peptide thereof, or a fusion 
protein comprising a 30 amino acid sequence portion of HCC5 
chemokine sequence; a substantially pure Dubll or peptide thereof, 
or a fusion protein comprising a 30 amino acid sequence portion of 
Dubll sequence; a substantially pure Dubl2 or peptide thereof, or 
a fusion protein comprising a 30 amino acid sequence portion of 
Dubll sequence; a substantially pure primate MD-1 or peptide 
thereof, or a fusion protein comprising a 30 amino acid sequence 
portion of primate MD-1 sequence,- a substantially pure primate MD- 
2 or peptide thereof, or a fusion protein comprising a 30 amino 
acid sequence portion of primate MD-2 sequence; a substantially 
pure rodent MD-2 or peptide thereof, or a fusion protein 
15 comprising a 3 0 amino acid sequence portion of rodent MD-2 
sequence; and then detecting binding of the compound to the 
composition. 

Particular polynucleotide embodiments include an isolated or 
recombinant polynucleotide which: encodes at least 17 contiguous 
amino acid residues of S2Q ID NO: 54; encodes at least two 
distinct segments of at least 10 contiguous amino acid residues of 
SEQ ID NO 54; or comprises one or more segments at least 21 
contiguous nucleotides of SEQ ID NO: 53. Such polynucleotides 
allow methods of making: a polypeptide comprising expressing a 
25 described expression vector, thereby producing the polypeptide; a 
duplex nucleic acid comprising contacting a polynucleotide with a 
complementary nucleic acid, thereby resulting in production of the 
duplex nucleic acid; a synthetic polynucleotide, comprising 
chemically polymerizing nucleotides to produce the polynucleotide ,- 
or a polynucleotide comprising using a PCR method. 

Cyclin polypeptide embodiments include an isolated or 
recombinant antigenic polypeptide comprising at least: one segment 
comprising at least 17 contiguous amino acids from SEQ id NO: 54; 
or at least two distinct segments of at least 11 contiguous amino 
35 acids from SEQ ID NO: 54. Such polypeptide may: comprise at least 
one segment comprising at least 17 contiguous amino acids from SEQ 
ID NO: 54; and exhibit at least two non-overlapping epitopes which 
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are selective for primate protein of SEQ ID NO: 54. Other 
embodiments include those wherein the polypeptide: is a 5- fold or 
less substitution from a natural sequence; is a deletion or 
insertion variant from a natural sequence; or comprises at least 
two distinct segments of at least 11 contiguous amino acids from 
SEQ ID NO: 54. Preferably the polypeptide is antigenic, and will 
typically comprise at least one sequence from (SEQ ID NO: 54) 
KESRYVHD (residues 120-127), DKHFEVLH (residues 127-134), HSDLEPQM 
(residues 134-141), QKDINKNM (residues 177-184), YAPKLQEF 
(residues 203-210), SEEDILHM (residues 219-226), LRMELIIL 
(residues 224-231), ELCPVTII (residues 237-244), and LFLQVDAL 
(residues 249-256); and/or the segments of at least 11 contiguous 
amino acids comprise one the segment with at least 14 contiguous 
amino acids, from SEQ ID NO: 54. Such polypeptides may further 
exhibit at least two non-overlapping epitopes which are selective 
for primate protein of SEQ ID NO: 54; and/or may: comprise a 
mature sequence of SEQ ID NO: 2; bind with selectivity to an 
antibody generated against an immunogen of SEQ ID NO: 54; comprise 
a plurality of polypeptide segments of 17 contiguous amino acids 
of SEQ ID NO: 54; or be a natural allelic variant of SEQ ID NO: 
54. The polypeptide may: be in a sterile composition; have a 
length at least 30 amino acids; be not glycosylated; be denatured; 
be a synthetic polypeptide; be attached to a solid substrate; or 
be a fusion protein with a detection or purification tag, 
including a FLAG, His6, or Ig sequence. Other embodiments include 
those wherein the polypeptide: is a 5-fold or less substitution 
from a natural sequence; or is a deletion or insertion variant 
from a natural sequence. 

Various kits are provided, e.g., which comprise such 
polypeptides and instructions for the use or disposal of the 
polypeptide or other reagents of the kit. 

Methods are provided, e.g., to label the polypeptide, 
comprising labeling the polypeptide with a radioactive label; to 
separate the polypeptide from another polypeptide in a mixture, 
comprising running the mixture on a chromatography matrix, thereby 
separating the polypeptides; to identify a compound that binds 
selectively to the polypeptide, comprising incubating the compound 
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with the polypeptide under appropriate conditions; thereby causing 
the component to bind to the polypeptide; to conjugate the 
polypeptide to a matrix, comprising derivatizing the polypeptide 
with a reactive reagent, and conjugating the polypeptide to the 
5 matrix; or inducing an antibody response to the polypeptide, 

comprising introducing the polypeptide as an antigen to an immune 
system, thereby inducing the response. 
15 Binding compounds are provided, e.g., antibodies, comprising 

an antigen binding portion from an antibody which binds with 
10 selectivity to described polypeptides . Methods are made available 
for evaluating the selectivity of binding of a compound to cyclin 
20 E2- comprising contacting the compound to a recombinant cyclin E2 

polypeptide and at least one other cyclin; and comparing binding 
of the compound to the eye 1 ins. 



15 

25 



30 



35 



40 



45 



50 



55 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/12366 



10 

5 



24 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

All references cited herein are incorporated herein by 
reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated to 
be incorporated by reference. 



10 



20 



25 



15 I . General 

It is to be understood that this invention is not limited to 
the particular compositions, methods, and techniques described 
herein, as such compositions, methods, and techniques may vary. 
It is to be understood that the terminology used herein is for the 
purpose of describing particular embodiments, and is not intended 
to limit the scope of the present invention which is to be limited 
15 by the appended claims. 

As used herein, including the appended claims, singular forms 
of words such as "a, » "an," and "the" include their corresponding 
plural referents unless the context clearly dictates otherwise. 
Thus, e.g., reference to "a polynucleotide" includes one or more 
30 20 diff erent polynucleotides, reference to "a coirposition" includes 

one or more of such compositions, and reference to "a method" 
includes reference to equivalent steps and methods known to a 
person of ordinary skill in the art, and so forth. 

Unless otherwise defined, technical and scientific terms used 
25 herein have the same meaning as commonly understood by a person of 
ordinary skill in the art to which this invention belongs. 
Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the 
present invention, suitable methods and materials are described 
30 below. All publications, patent applications, patents, and other 
references discussed above are provided for their disclosure prior 
to the filing date of the present application. Nothing herein is 
to be construed as an admission that the invention is not entitled 
to antedate any such disclosure by virtue of its prior invention. 
35 The present invention also provides amino acid sequences and 

DNA sequences encoding various mammalian proteins, e.g., which are 
polypeptides produced by selected cells. Among these proteins are 
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those which: mediate uptake of substrates, e.g.. prostaglandin- 
like molecules, modulate or mediate, e.g., induce or prevent 
trafficking, proliferation, or differentiation of, interacting 
cells, or are intracellular proteins which are important in 
various cellular processes, e.g., deubiquitination of proteins or 
cell cycle regulation. 

The Prostaglandin-like Transporter (PGT) of the present 
invention is expressed particularly in antigen presenting cells of 
the immune system, e.g., dendritic cells. As such, the 
transporter is designated a dendritic cell prostaglandin-like 
transporter (DC-PGT) . Consequently, the DC- PGT of the present 
invention offers the means to establish fundamental understanding 
on the role of PG influence on immune function. 

The present invention provides DNA sequence encoding a 
mammalian protein that exhibits structural features characteristic 
25 of functionally significant proteins, particularly which serve as 

organic anion transporters. This family of organic anion 
transporters includes: the prostaglandin transporters of man (Lu, 
et al. (1996) J, Clin. Invest 98:1142-1149) and rat; organic 
anion transporters in man and rat; brain digoxyin transporters and 
Matrin F/G of rat (Kanai, et al . (1995) Science 268:866-869). 

Transporters of this family typically are 12 transmembrane 
proteins of approximately 650 amino acids in length. 
Characteristic of this group of proteins is a cysteine rich region 
located in one of the extracellular loops, which resembles a zinc 
finger motif. It is not entirely certain whether these 
polypeptides mediate primarily the influx or efflux of 
prostaglandins and organic anions, and they may, under different 
circumstances produce influx or efflux depending, e.g., on the 
intracellular concentration of the organic anions concerned. 

The DC PGT protein of the present invention is closest in 
homology to the prostaglandin transporters and it is probable that 
a prostaglandin is the major anion transported. The human gene 
embodiment described herein, isolated as designate DC-PGT or clone 
240, contains an open reading frame encoding a presumptive protein 
of about 709 amino acids. This protein exhibits intracellular, 
transmembrane, and extracellular protein segments, revealing novel 
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aspects of organic anion transport that may be relevant during 
mamnalxan development, e.g., development of dendritic cells of the 
immune system. 

The introduction of evolutionary information in the form of 
5 sequence homologs simplifies the structural analysis considerably 
for related molecules which share a common structural framework in 
sprte of considerable sequence divergence, see, e.g., chothia and 
Lesk (1986) EMBO^ 5:823-826. This concept can be effectively 
extended to the strong prediction of TM regions across an aligned 

10 protein family, whereas any single sequence may provide an 

uncertain topology. See Persson and Argos (1994) J. MoT . mm 
237:182-192; and Host, et al. (1995) Protein Bri 4:521-533. For 
the DC PGT, a number of sequence homologs were first asserrtoled by 
comparative matching to protein and translated nucleotide 

15 databases (Altschul, et al . (1994) Nature Gm* 6:119-129; 

Koonin, et al . (1994) SJBO^L 13:493-503). These relatives of DC- 
PGT include a ubiquitously expressed PGT from primate, e.g., human 
(GenBank.- locus HSU70867, accession U70867) , and a PGT from 
rodent, e.g., rat (prostaglandin transporter - rat, GenBank Acc 

0 No. 1083766; Kanai, et ai . (1995) Science 268:866-869). These 
sequences were subjected to parallel analyses by a suite of 
computer programs that have greatly improved on the initial Kyte 
and Doolittle (1982, hydropathic profile as a means of predicting 
the topology of integral membrane proteins. Four algorithms 

5 (ALOM, MTOP, MEMSAT and TopPredll) (Klein, et al . (1985) Biochim. 
Biophys. Acta 815:468-476; Harunann, et al. (1989) Proc. n*mi 
Acad. sci. U.Sfl 86:5786-5790; Jones, et al. (1994) Biorb^ 
33:3038-3049; and Claros and von Heijne (1994) Como. AnnUr 
fiiosgjU 10:685-686) were used to individually predict TM 

) extensions and orientations; these predictions were pooled and 
mapped onto the multiple sequence alignment produced by ClustalW 
and MACAW (Thompson, et al. (1994) Nucl. Arir... ^ 22:4673-4680; 
and Schuler, et al. (1991) Pj^eins 9:180-190). Furthermore, 
these multiply aligned sequence files were used as input to PHD 

1 and TMAP (Rost, et al. (1995) P rotein S cU 4:521-533; Persson and 
Argos (1994) J. ml. Bio1 237:182-192) for a familial prediction 
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of shared TO regions. Structural features that persisted in this 
two-step analysis are likely to be shared topological traits 
present in all members of this organic anion transporter family. 

HDTEA84, HSLJD37R, and RANKL genes and proteins are also 
provided, which are related to the TNF signaling pathways. The 
antigens HDTEA84, HSLJD37R, and RANKL, and fragments, or 
antagonists will be useful in physiological modulation of cells 
expressing receptors for, e.g., ligands of the TNF family. Some 
of these antigens appear to lack a membrane spanning segment, 
suggesting that they are soluble forms of receptor. This suggests 
that the soluble proteins can serve as antagonists of the TNF-like 
ligands. In addition, it is likely that membrane spanning forms 
exist, which serve as signaling receptors mediating cellular 
response to the ligands. 

The HDTEA84 gene has been detected in cDNA libraries derived 
form Hodgkin's lymphoma, endothelial cells, keratinocytes , 
prostrate, and cerebellum. It exhibits significant sequence 
similarity to the osteoprotegerin ligand receptor reported by 
Lacey, et al. (1998) Cell 93:165-176. The HDTEA84 will likely 
modulate proliferation or development by antagonizing i~s 
respective ligand. Membrane associated forms should exist, likely 
alternatively spliced transcription products. 

The HSLJD37R exhibits like similarity to receptors for TNF. 
While the first embodiment is an incomplete sequence, the 
available portion currently lacks an identified transmembrane 
segment. Additional efforts provide a full length sequence, and 
an alternative splice variant. 

The rodent 427152#4 Rank- like (RANKL) was detected in a 
rodent cDNA library panel probed with Mouse 427152#4 (204 bp) . 
Positive signals were detected in CK12 (B cell line) ; rag-1 
thymus; rag-1 heart; rag-1 brain (best signal); rag-1 testes; rag- 
1 liver; normal lung,- rag-1 lung; asthmatic lung; tolerized and 
challenged lung; Nippo- infected lung; Nippo IL-4 K.O. lung; Nippo 
anti-IL-5 treated lung; influenza lung; guinea pig allergic lung; 
w.t. stomach; and w.t. colon on a 3 day exposure at -80° C with an 
intsnsifier screen. On a 2 week exposure at -80° C with screen, 
signals were also detected in the following libraries: Mell4+ 
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naive; Mell4+ Thl; Mell4+ Th2; Thl 3 week Bl/6; large B cell; 
bEnd3 + TNFcc + IL-10, guinea pig normal lung; and Rag Hh- colon. 

The primate, e.g., human. Rank-like (RANKL) homologs of 
rodent 427152#4 were detected in a human cDNA library panel probed 
with mouse 427152#4 (204 bp) . Signals were detected in monkey 
asthma lung 4 h (1.6-2.0 kb) and adult placenta (2.5-3.0 kb) on a 
3 day exposure at -80° C with screen. On a 2 week exposure at - 
80° C with screen, signals were also detected in the following 
libraries: CDla+ 95% DC activated CHA (kidney epithelial carcinoma 
cell line); monkey lung normal; psoriasis skin; fetal lung; fetal 
ovary; fetal testes; and fetal spleen. 

Each of these proteins will also be useful as antigens, e.g., 
immunogens, for raising antibodies to various epitopes on the 
protein, linear and/or conformational epitopes. The molecules may 
be useful in defining various cell subsets, either by the 
molecules produced by, or by expression of membrane forms of the 
receptors. Such cells should be responsive to the respective 
ligands. Soluble forms of the receptors should serve as 
antagonists of the ligand, binding to the ligand and preventing 
interaction with membrane forms, which would mediate signaling. 

Both genes express proteins which exhibit structural motifs 
characteristic of a member of the TNF receptor family. SEQ ID NO: 5 
and SEQ ID NO: 6, respectively, provide the nucleic acid and 
predicted amino acid sequences for primate, e.g., human, HDTEA84. 
SEQ ID NO: 7 and SEQ ID NO: 8, respectively, provide the nucleic 
acid and predicted amino acid sequences for primate, e.g., human, 
HSLJD37R. 

Interesting features of the HDTEA84 include: signal sequence 
from about 1-11; TNF receptor Cys rich domains I (about 32-72), II 
(about 73-113), III (about 114-150), and IV (about 151-193); and 
unique region from about 194-300. Features for the HSLJD37R (SEQ 
ID NO: 10 form), partly based on alignment with HDTEA84: signal 
sequence from about 1-41; TNF recepcor Cys rich domains I (about 
42-90), II (about 91-131), III (about 132-168), and IV (about 169- 
211); transmembrane segment from about 354-370. Similar alignment 
of the other variants will identify similar features. Segments 
including combinations or excluding such segments may be desired. 
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The structural homology of HDTEA84, HSLJD37R, and RANKL to 
members of the TNF receptor family suggests related function of 
these molecules. See, e.g., Lacey, et al. (1998) Cell 93:165-176. 
The sequences, however, mostly lack a transmembrane segment, 
5 suggesting that the proteins are soluble receptor forms. They may 
well also have membrane bound forms resulting, e.g., from 
alternatively spliced transcript variants. The soluble forms are 
likely to be antagonists of the ligand, e.g., blocking the binding 
of ligand to a membrane bound form of signaling receptor. Thus, 
10 these molecules may be useful in the treatment of abnormal immune 
or developmental disorders. 

The natural antigens should be capable of modulating various 
biochemical responses which lead to biological or physiological 
responses in target cells. The embodiments characterized herein 
15 are from primate, e.g., human, but other species variants almost 
25 surely exist, e.g., rodents, etc. See below. The descriptions 

below are directed, for exemplary purposes, to primate HDTEA84, 
HSLOD37R, or RANKL, but are likewise applicable to related 
embodiments from other species . 
30 20 The HDTEA84, HSLJD37R, and RANKL clones were assembled 

through the careful analysis of ESTs present in various databases, 
e.g., Merck-WashU public database. These genes exhibit structural 
motifs characteristic of a member of the TNF receptor family. 
Compare, e.g., with the TNF receptor, NGF-receptor , and FAS 
25 receptor. Table 3 discloses the nucleic acid and predicted amino 
acid sequences for primate, e.g., human, HDTEA84. The ESTs were 
identified from several different libraries. 

SEQ ID NO: 7 AND SEQ ID NO: 8, respectively, disclose partial 
nucleic acid and predicted amino acid sequences for primate, e.g., 
human, HSLJD37R. The ESTs were identified from several different 
libraries derived from: smooth muscle, pancreas tumor, adipocytes, 
HUVEC cells, adult pulmonary, endothelial cells, prostate cell 
line PC3, microvascular endothelial cells, fetal heart, and 
dendritic cells. Other sequences were detected in libraries from: 
35 multiple sclerosis lesions, breast, kidney, and germinal center B 
cells . 
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SEQ ID NO: 16, SEQ ID NO: 18. SEQ ID NO: 20 and SEQ ID NO- 22 
provide the sequences of various mammalian genes designated RANKL. 

Interesting features of the rodent RANKL include: signal 
sequence from about 1-29; tnf receptor Cys rich domain I (about 
33-74), II (about 75-114), and in (about 115-135). Interesting 
features of the primate RANKL include: TNF receptor Cys rich 
domain I (about 1-43), n (about 44-83), and III (about 84-104) - 
transmembrane segment from about 139-155. Alignment with other' 
TOP receptors will identify additional interesting corresponding 
features. Segments with boundaries at these positions may be 
especially interesting. 

Hybridization signals with RANKL were detected with rodent, 
e.g., mouse sequence, in CH12 (B cell line), rag-1 thymus, rag-1 
heart, rag-1 brain (strongest signal), rag-1 testes, rag-1 liver 
normal lung, rag-1 lung, asthmatic lung, tolerized and challenged 
lung, Nippo-infected lung, Nippo IL-4 K.O. lung, Nippo anti-IL-5 
lung, influenza lung, guinea pig allergic lung, w.t. stomach, and 
w.t. colon on a 3 day exposure at -80° C with a screen. On a 2 
week exposure at -80° C with screen, signals were also detected 
in the following libraries: Mel 14+ naive, Mell4 + Thl, Mell4 + Th2, 
Thl 3 week Bl/6, large B cell, bEnd3 + TNFa + IL-10, guinea pig 
normal lung, and Rag Hh- colon. Probes of human libraries with 
rodent sequence provided: detectable signals in Monkey asthma 
lung 4 h (1.6-2.0 kb) and adult placenta (2.5-3.0 kb) on a 3 day 
25 exposure at -80° C with screen. On a 2 week exposure at -80° C 
with screen, signals were also detected in the following 
libraries: CDla+ 95% DC activated. CHA (kidney epithelial 
carcinoma cell line) , monkey lung normal, psoriasis skin, fetal 
lung, fetal ovary, fetal testes, and fetal spleen. 
30 In another embodiment, the invention provides a chemokine. 

For a review of the chemokine family, see, e.g., Lodi, et al. 
45 (1994 » Silence 263:1762-1767; Gronenborn and Clore (1991) Protein 

Eng i neering 4:263-269; Miller and Kranger (1992) Proc. Marl &r- ? H 
Sci - UgA 89=2950-2954; Matsushima and Oppenheiin (1989) Cytokine 
35 1:2-13; Stoeckle and Baker (1990) New Biol 2:313-323; Oppenheim. 
et al. (1991) Anji, Rev. Immunol 9:617-648; Schall (1991) CvtokinP 
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3:165-183; and Thomson {ed. 1994) The Cvtnk-i.na Handbook 2d ed. 
Academic Press, NY. 

The new chemokine described herein is designated HCC5 which 
is a CC chemokine. See SEQ ID NO: 24 and SEQ ID NO: 25. The 
descriptions are directed, for exemplary purposes, to the human 
HCC5 natural allele described, but are likewise applicable to 
allelic and/or polymorphic variants, e.g., from other individuals, 
as well as splicing variants, e.g., natural forms. Based on 
sequence analysis of the chemokine protein sequences described 
below, it is apparent that HCC5 belongs tc the CC chemokine 
family. See, e.g., stem cell mobilizing chemokine (CKbeta-1) from 
Kreider, et al. (1997) Patent WO 9715594 (SEQ ID NO: 26) and 
GenBank Accession number 97P-W17659; macrophage inflammatory 
protein-1 -gamma (MIP-1) from Adams, et al . (1995) Patent WO 
9517092 (SEQ ID NO: 27) and GenBank Accession number 95P-R76128; 
human MIP-4, a chemoattractant for leukocytes from Adams, et al . 
(1997) Patent WO 9 634891 (SEQ ID NO: 28) and GenBank Accession 
number 96P-W07203; pituitary expressed chemokine (PGEC) from 
Bandman, et al., Patent WO 9616979 (SEQ ID NO: 29) and GenBank 
Accession number 96P-R95691; and human chemokine KCC-1 from 
Forsmann, et al. (1S98) Patent WO 9741230 (SEQ ID NO: 30) and 
GenBank Accession number 97P-W38171. 

The HCC5 chemokine was discovered through searches and 
careful analysis of database sequences. The HCC5 sequence was 
discovered in a cDNA library from pooled bulk breast tumor tissue. 
Absence of overlapping sequences from other sources suggests 
extremely specific tissue expression, or highly regulated 
expression. Amino acid homology analysis suggests that the HCC5 
gene encodes a member of a group of related family of chemokine s . 
The primate, e.g., human, HCC5 chemokine is most closely related 
in sequence to the chemokines, human chemokine HCC1; human 
pituitary expressed chemokine (PGEC); human MIP-4 (a 
chemoattractant for leukocytes) ; human macrophage inflammatory 
protein-1 -gamma (MlP-ly) ; and human stem cell mobilizing chemokine 
(CKbeta-1) . 

The HCC5 chemokine is seemingly specifically expressed, since 
its sequence has not appeared from many sources. The structural 
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similarity to other chemokines suggests that signals important in 
inflammation, cell differentiation, and development are mediated 
by it. 

It is possible that the HCC5 may actually be an antagonist of 
one, some, or all, of many related chemokines. In such case, 
combination compositions may be desired. For example, a combined 
group of functional agonists and antagonists for specific 
receptors may be called for, e.g., a combination of chemokines and 
antibody antagonists of others.. In addition, HCC5 may be useful 
to block HIV or HTLV infection, which viruses may use the 
respective receptors for infection. 

The HCC5 chemokine exhibits limited similarity to portions of 
known chemokines. See, e.g., Matsushima and Oppenheim (1989) 
Cyt ° kine 1:2 -«; Oppenheim, et al . (1991) Ann. Rev. jm™, ^ 
9:617-648; Schall (1991) Cytokine 3:165-183; and Gronenborn and 
Clore (1991) Protein Rpqinesri ng 4:263-269. Other features of 
comparison are apparent between the HCC5 chemokine and chemokine 
families. See, e.g., Lodi, et al. (1994) Scien^ 263:1762-1756. 
In particular, p-sheet and a-heiix residues can be determined 
using, e.g., RASMOL program, see Sayle and Milner-White (1995) 
TIBS 20:374-376; or Gronenberg, et al. (1991) Protein Fnoin^-i^ 
4:263-269; and other structural features are defined in Lodi, et 
al- (1994) Science 263:1762-1767. These secondary and tertiary 
features assist in defining further the C, CC, CXC, and CX3C 
structural features, along with spacing of appropriate cysteine 
residues . 

Antagonists might be created by N-terminal modification, 
e.g., either truncation of addition of an N-terminal methionine. 
Since HCC5 is structurally related to the HCC chemokines, it may 
well exhibit similar behaviors and functions. 

The distribution of the HCC 5 chemokines, especially in 
dendritic cells, or in Thl T cells, B cells, and macrophages, 
suggest roles in immune functions, e.g., it will likely attract T 
cells and monocytes.. Thus, the HCC 5 chemokine is likely to 
recruit these cell types in vivo, thereby enhancing the immune 
response mediated by these cell types. The expression patterns 
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appear consistent with a functional importance of the ligands in a 
TH1/TH2 regulation and/or response, including, e.g., in a cancer 
therapy. Thus, ligands and homologs are identified as possible 
immune adjuvants, e.g., for cellular responses, but also as 
5 possible adjuvants to modulate soluble antigen responses, e.g., 
vaccines . 

The invention further provides mammalian, e.g., primate, DNA 
sequences encoding proteins which exhibit structural properties of 
likely intracellular deubiquitinating protein enzymes. These 
10 proteins are designated deubiqutinating 11 (Dubll) and 

deubiqutinating 12 (Dubl2) . For a review of the superfamily of 
deubiquitinating enzymes see, e.g., Hochstrasser (1995) Curr. 
Opin. Cell Biol . 7:215-223; Wilkinson, et al . (1995) Biochemistry 
34:14535-14546; Baker, et al. (1992) J. Biol. C.hvm 267:23364- 
23375; and Papa and Hochstrasser (1993) Nature 366:313-319. 
However, the deubiquitinating enzymes have also been reported to 
have additional functions besides deubiquitination. See, e.g., 
Hochstrasser (1996) Cell 84:813-815; Hicke and Riezman (1996) Cell 
84:277-287; and Chen, et al . (1996) Cell 84:853-862. 

The descriptions typically are directed, for exemplary 
purposes, to the human Dubll and human Dubl2 natural alleles 
described, but are likewise applicable to allelic and/or 
polymorphic variants, e.g., from other individuals, as well as 
splicing variants, e.g., natural forms, and species variants from 
25 other primates or other species. These genes will allow isolation 
of other primate genes encoding proteins related to this, further 
extending the family beyond the specific embodiments described. 

The Dubll or Dubl2 proteins (naturally occurring or 
recombinant), fragments thereof, and antibodies thereto, along 
with compounds identified as having binding affinity to Dubll or 
Dubl2, may be useful in the treatment of conditions associated 
with, abnormal physiology or development, such as, e.g., uterine 
carcinoma associated with p53 dysregulation associated with human 
papilloma virus or mental retardation of Angelman syndrome (AS) 
35 due to disruption of the 5' end of the UBE3A (E6-AP) gene which 
codes for a disubiquitination protein. Pharmacological 
intervention which alters the half-lives of cellular proteins 
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associated with these diseases may have wide therapeutic 
potential. Specifically, prevention of p53 ubiquitination (and 
subsequent degradation) in human papilloma virus positive tumors, 
and perhaps all tumors retaining wild-type p53 but lacking the 
retinoblastoma gene function, could lead to programmed cell death. 
Additionally, specific inhibitors of p27 and cyclin B 
ubiquitination are predicted to be potent antiproliferative 
agents. Inhibitors of IkappaB ubiquitination should prevent 
NFkappaB activation and may have utility in a variety of 
autoimmune and inflammatory conditions. Finally, deubiquitination 
enzymes may be novel, potential drug targets as they also appear 
to regulate cell proliferation. These conditions or disease 
states may be modulated by appropriate therapeutic treatment using 
the deubiquitination compositions provided herein. 

Conversely, methods for blocking the enzymatic activities 
should have the opposite effects. Small molecule drug screening 
to block enzymatic activity of the protein can be performed to 
identify entities which will block the deubiquitination, thereby 
affecting protein degradation pathways, as appropriate. 

The T cell growth factor interleukin-2 (IL-2) regulates 
lymphocyte proliferation by inducing the expression of growth 
promoting genes. HTLV-1 transformed cell lines derived from Adult 
T-cell Leukemia (ATL) can exhibit constitutive activation of the 
IL-2-induced JAK/STAT pathway. See Migone, et al. (1998) Proc. 
gaf 1 Acad, sci , tt P a 95:3845-3850. ATL cell lines were examined 
for expression of IL-2 induced genes. It was found that the 
deubiquitinating enzyme Dub2 is constitutively expressed. See 
Zhu, et al. (1997) J, Biol. rhojy 272:51-57. Moreover, Dub2 
expression conferred cytokine-independent proliferation on the 
interleukin-3 -dependent murine Ba/F3 hematopoietic cell line. 
SCID mice (n = 18) subcutaneously injected with Ba/F3 cells 
expressing Dub2 , (but not a C to S inactive mutant of Dub2> 
developed tumors with a six week latency. Cells derived from 
these tumors exhibited constitutive tyrosine phosphorylation of 
STAT5 and also mimicked the ATL cell lines by exhibiting down- 
regulation of the protein tyrosine phosphatase SHP-1 . These 
findings strongly indicate that Dubl2 is an oncogene that, when 
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constitutively expressed, can induce cytokine-independent growth 
in lymphocytes and may be implicated in leukemogenesis . it is 
likely that Dub2 controls cell growth by regulating the ubiquitin- 
dependent proteolysis or the ubiquitin-dependent state of a 
critical intracellular substrate. Functional similarity of the 
Dubll and Dubl2 would be expected. Thus, the biological role of 
Dub2 suggests similar important roles for the other Dub family 
members . 

Screening for inhibitors of the DUB enzymes can also be 
easily accomplished using the known assays for activity. Such 
assays can be developed into high throughput screening efforts, 
testing, particularly, compounds known to affect protein turnover, 
or similar enzymatic sites. Small molecule antagonists of the 
enzymes, which will be membrane permeable, would be particularly 
15 desirable therapeutically. 

in the MD embodiments of the present invention, mammalian, 
e.g., primate, and rodent, e.g., mouse, DMA sequences are provided 
encoding proteins which exhibit structural properties of ligands 
for proteins exhibiting a leucine-rich protein motif (LRR) that is 
30 20 important, e.g., in immune function. These proteins are 

designated herein human MD-1, human MD-2, and murine MD-2 . The 
human MD-1 is a primate horaolog of the previously described rodent 
MD-1, see, e.g., Miyake, et al. (1998) J. Tmm.mnl 161:1348-1353, 
35 while human md-2 and mouse MD-2 are newly discovered MD-1 homolog. 

25 For a general review of LRR proteins, see, e.g., Kobe and 

Deisenhofer (1994) Trends Rio^ hem. sri 19:412. For the role of 
LRR in specific immune defenses, see, e.g., Jones, et al. (1994) 
40 Science 266:789; Dixon, et al . (1996) Cell 84:451; and Baker, et 

al. (1997) Science 276:726. 
30 Similar sequences for proteins in other species should also 

be available. The descriptions below are directed, for exemplary 
45 purposes, to the primate, e.g., human, MD-1 and MD-2, and rodent, 

e.g., mouse, MD-2 natural alleles described, but are likewise 
applicable to allelic and/or polymorphic variants, e.g., from 
35 other individuals, as well as splicing variants, e.g., natural 
50 forms , and species variants . 
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The MD-1 or MD-2 proteins (naturally occurring or 
recombinant), fragments thereof, and antibodies thereto, along 
with compounds identified as having binding affinity to MD-1 or 
MD-2, should be useful in the treatment of conditions associated 
with abnormal physiology or development, such as, e.g., the 
recognition of specific pathogenic molecules and the activation of 
B cell physiology. As indicated above, MD-1 and MD-2 exhibit 
structural motifs characteristic of ligands for the RP105 or BAS-1 
surface receptors. Thus, soluble forms, antibodies, or small 
molecule drugs which disrupt intercellular signaling mediated by 
these receptors, will find use in modulating cellular response. 
These responses will be important in normal or abnormal clinical 
situations . 

The matching of the MD and RP105 may also be easily tested. 
Identification of the counter receptor for the MD-2 may include 
testing both the RP105 and BAS-1 genes, along with other screening 
methods, as described. The likely counter receptor structure for 
the MDs are RP105, BAS-1, and related genes. Associated proteins 
which bind to these, including the DUB proteins, may be identified 
using these techniques, among others. 

Another aspect of the invention provides members of the 
cyclin proteins. The cyclins and their partner catalytic 
subunits, the cyclin-dependent kinases (Cdks) , play key roles in 
the regulation of eukaryotic cell cycle events. See, e.g., 
Draetta (1994) Curr. Qpjn. Cell Riol 6:842-846; Sherr (1994) Cell 
79:551-555; and Ohtsubo, et al. (1995) Mol. Cell. R-inl , 15:2612- 
2624. Cyclins were first identified in marine invertebrates on 
the basis of their dramatic cell cycle periodic expression during 
meiotic and mitotic divisions. 

A large family of cyclins, designated cyclins A-H, bind and 
activate different Cdks which are serine/threonine kinases 
essential for cell cycle progression. The timing of the 
expression of the various cyclins is key in determining at which 
phase of the cell cycle (S, Go, Gi, or G 2 ) their associated Cdk is 
active. D-type cyclins are synthesized early in Gi and bind and 
activate CDK4 and CDK6. Cyclin E-Cdk2 and Cyclin A-Cdk2 complexes 
form later in Gi as cells prepare to begin DNA synthesis. Cyclin 
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B-cdc2 is active during G 2 and mitosis. See, e.g.. Lees (1995) 
.Curr. Opin. Cell Binl 7:773-780. 

Other Cyclin-Cdk complex associated proteins are critical for 
modulation of cyclin activity. Proteins that co- 
5 immunoprecipitated with cyclin E were visualized by SDS-PAGE 

However, viability of the cyclin E "knockout" mouse, suggested the 
existence of redundancy. Moreover, work in other species also 
suggested that a homolog might exist in human. 

Cdks can also exert control on cell division and 
10 proliferation by phosphorylating specific intracellular target 
proteins. This phosphorylation event can induce the cellular 
transition from the Gi to the S phase of the cell cycle. See, 
e.g., Strahler, et al. (1992) Biochem. ^nohv S . m r^ r 
185:197-203; Brattsand, etal. (1994) Eur. J. H Wh m oon.^o 
15 368; and Li, et al. (1996) CjlLI 85:319-329. Regulation of the 
cell cycle machinery is important in development and control of 
cellular proliferation. Misregulation may lead to proliferative 
disorders, e.g., neoplastic conditions and cancer. See, e.g., 
Sherr (1998) Science 274:1672-1677. 
0 The novel cyclin gene, designated cyclin E2, exhibits about 

49% structural identity to the known human cyclin E. See, e g 
Lew, et al. (1991) Cell 66:1197-1206; and NCBI Entrez accession' 
number M74093. The new variant cyclin E2 sequences are orovided 
in SEQ ID NO: 52 and SEQ ID NO: 53. Notable features on the E2 
sequence include the cyclin box running from about residue 120- 
254; and a putative phosphorylation site at thr392 . The 
phosphorylation site is trigger in cyclin E for ubiquitin 
dependent degradation. See Clurman, et al. (1996) Genes nnd 
Development 10:1979-1990. Particularly interesting segments 
include, e.g., from about 93-100; 98-106; 104-113; 107-121; 120- 
128; 124-134; 131-137; 172-177; 175-185; 189-193; 196-202 ; 200- 
210; 218-223; 228-232; 236-242; 240-245; and 248-252. 

The structural homology of these genes to identified families 
suggests related function of these molecules. For example, PGT 
homologs should function in transport across cell membranes; TNF 
receptor family antagonists, or agonists, may act as a co- 
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stimulatory molecule for regulation of T cell mediated cell 
activation, and may in fact, cause a shift of T helper cell types 
e.g., between Thi and Th2; chemokines have recognized functional ' 
properties; intracellular Dubs have been described and the role in 
oncogenesis established; membrane associated or soluble forms of 
signaling proteins such as the MDs are well known; and the role of 
cyclins in cell cycle regulation are recognized. Alternatively, 
the ligands or binding structures for the cell surface antigens 
may serve to regulate cell proliferation or development. 

For the TNF ligand molecules, they typically modulate cell 
proliferation, viability, and differentiation. For example TNF 
and FAS can kill cells expressing their respective receptors 
including fibroblasts, liver cells, and lymphocytes. Some members 
of thxs class of ligands exhibit effects on cellular proliferation 
of cells expressing their respective receptors, e.g., B cells 
25 expressing CD40. These effects on proliferation may also effect 

subsequent differentiation steps, and may lead, directly or 
indirectly, to changes in cytokine expression profiles. 
The members of the TNF ligand family also exhibit 
20 costimulation effects, which may also regulate cellular 

differentiation or apoptosis. Receptor expressing cells may be 
protected from activation induced cell death (AICD) or apoptosis. 
For example, CD40 ligand can have effects on T and B lymphocytes. 
The embodiments characterized herein are mostly from human, 
25 but additional sequences for proteins in other mammalian species, 
e.g., primates and rodents, will also be available. See below. 
In particular, with the polypeptide sequences provided, reverse 
translation, e.g., using the genetic code, software is available, 
which will indicate what nucleic acid sequences could encode them. 
30 See, e.g., MacVector, Oxford Molecular Group Software. Thus, 

artificial genes, or redundant oligonucleotides may be selected to 
isolate natural variants or species counterparts. 



15 



II. Purified Protein 
35 Primate, e.g., human, DC-PGT polypeptide sequence is shown in 

SEQ ID NO: 2; primate, e.g., human, HDTKA84 polypeptide sequence 
is shown in SEQ ID NO: 6; primate, e.g., human, HSLJD37R 
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polypeptide sequences are shown in SEQ ID NO: 8, 10, and 12; 
rodent, e.g., murine, RANKL polypeptide sequence is shown in SEQ 
ID NO: 17; primate forms of RANKL polypeptide sequence are shown 
in SEQ ID NO: 19, 21, and 23; primate, e.g., human, HCC5 chemokine 
polypeptide sequence is shown in SEQ ID NO: 25; primate, e.g., 
human, Dubll polypeptide sequences are shown in SEQ ID NO: 32 and 
34; primate, e.g., human. Dubl2 polypeptide sequences are shown in 
SEQ ID NO: 3 6 and 38; primate, e.g., human, MD-1 polypeptide 
sequence is shown in SEQ ID NO: 42; primate, e.g.. human, MD-2 
polypeptide sequence is shown in SEQ ID NO: 44 and 46; rodent, 
e.g., mouse, MD-2 polypeptide sequences are shown in SEQ ID NO: 48 
and 49; and primate, e.g., human, cyclin E2 is shown in SEQ ID NO: 
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These amino acid sequences, provided amino to carboxy, are 
important in providing sequence information in the antigen 
allowing for distinguishing the protein from other proteins and 
exemplifying numerous variants. Moreover, the peptide sequences 
allow preparation of peptides to generate antibodies to recognize 
such segments, and nucleotide sequences allow preparation of 
oligonucleotide probes, both of which are strategies for detection 
or isolation, e.g., cloning, of genes or cDNAs encoding such 
sequences . 

The purified protein, or proteins will typically comprise a 
35 plurality of distinct, e.g., nonoverlappir.g, segments of the 

25 specified length. Typically, the plurality will be at least two, 
more usually at least three, and preferably 5, 7, or even more. 
While the length minima are provided, longer lengths, of various 
sizes, may be appropriate, e.g., one of length 7, and two of 
length 12. Such peptides are useful for generating antibodies by 
standard methods, as described herein. Synthetic peptides or 
purified protein can be presented to an immune system to generate 
45 a specific binding composition, e.g., monoclonal or polyclonal 

antibodies. See, e.g., Coligan (1991) Current Protocols in 
Immunol oqy Wiley /Greene; and Harlow and Lane (Current ed.) 
35 Antibodies; A, Laboratory Ma n na . 1 Cold Spring Harbor Press. 
50 For example, the specific binding composition could be used 

for screening of an expression library made from a cell line which 
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expresses a clone encoding, e.g., a prostaglandin transporter 
The screening can be standard staining of surface expressed 
protein, or by panning. Screening of intracellular expression can 
also be performed by various staining or immunofluorescence 
procedures. The binding compositions could be used to affinity 
purify or sort out cells expressing the protein. The binding 
compositions may also be useful in determining qualitative and 
quantitative expression levels of the proteins in various 
biological samples, including, e.g., cell types or tissues. 

As used herein, the term, e.g., -human DC-PGT". shall 
encompass, when used in a protein context, a protein having amino 
20 aCld se <^ ence shown in SEQ ID NO: 2. Significant polypeptide 

fragments of such a protein should preserve some of the 
properties, biological or physical, of the full length protein 
15 Other essentially identical or equivalent proteins may be found in 
25 other Penates or related species. In addition, binding 

components, e.g., antibodies, typically bind to, e.g., a DC-PGT 
with high affinity, e.g., at least about 100 nM, usually better' 
than about 30 nM, preferably better than about 10 nM, and more 
preferably at better than about 3 nM. Homologous proteins would 
be found m mammalian species other than human, e.g., primates or 
rodents. Non-mammalian species should also possess structurally 
or functionally related genes and proteins, e.g., birds or 
amphibians . Similar meanings apply in reference to HDTEA84 
25 HSLJD37R, RANKL, HCC5, Dubll, Dubl2, MD-1, md- 2 , Md C yclinE2. 

The term polypeptide, as used herein, includes a significant 
fragment or segment, and encompasses a stretch of amino acid 
residues of at least about 8 amino acids, generally at least 10 
amino acids, more generally at least 12 amino acids, often at 
least 14 amino acids, more often at least 16 amino acids, 
typically at least 18 amino acids, more typically at least 20 
amino acids, usually at least 22 amino acids, more usually at 
least 24 amino acids, preferably at least 26 amino acids, more 
preferably at least 28 amino acids, and, in particularly preferred 
35 embodiments, at least about 30 or more amino acids. The segments 
may have lengths of at least 37, 45. 53. 61, 70, 80, 90, etc., and 
often will encompass a plurality of such matching sequences. The 
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specific ends of such a segment will be at any combinations within 
the protein. In certain embodiments, there will be a plurality of 
distinct, e.g., nonoverlapping , segments of the specified length. 
Typically, the plurality will be at least two, more usually at 
least three, and preferably 5, 7, or even more. While the length 
minima are provided, longer lengths, of various sizes, may be 
appropriate, e.g., one of length 7, and two of length 12. 

The term "binding composition' refers to molecules that bind 
with specificity to the respective protein or polypeptide, e.g., 
DC-PGT, e.g., in a cell adhesion pairing type fashion, or an 
antibody-antigen interaction. Other compounds include, e.g., 
proteins, which specifically associate with DC-PGT, including in a 
natural physiologically relevant procein-protein interaction, 
either covalent or non-covalent. The molecule may be a polymer, 
or chemical reagent. A functional analog may be an antigen with 
25 structural modifications, or it may be a molecule which has a 

molecular shape which interacts with the appropriate binding 
determinants. The compounds may serve as agonists or antagonists 
of the binding interaction, see, e.g., Goodman, et al. (eds. 1990) 
Goodman & Oilman's; Thfi Pharmacol nrri^i B ase* of ^ raDmHp . 
(8th ed. ) Pergamon Press. 

Substantially pure, in the polypeptide context, typically 
means that the protein is free from other contaminating proteins, 
35 nucleic acids, and other biologicals derived from the original 

source organism or cell. Purity may be assayed by standard 
methods, and will ordinarily be at least about 40% pure, more 
ordinarily at least about 50% pure, generally at least about 60% 
pure, more generally at least about 70% pure, often at least about 
75% pure, more often at least about 80% pure, typically at least 
about 85% pure, more typically at least about 90% pure, preferably 
at least about 95% pure, more preferably at least about 98% pure, 
and in most preferred embodiments, at least 99% pure. The 
analysis may be weight or molar percentages, evaluated, e.g., by 
gel staining, spectrophotometry, or terminus labeling. Carriers 
35 or excipients will often be subsequently added. 

Solubility of a polypeptide or fragment depends upon the 
environment and the polypeptide. Many parameters affect 
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polypeptide solubility, including temperature, electrolyte 
environment, size and molecular characteristics of the 
polypeptide, and nature of the solvent. Typically, the 
temperature at which the polypeptide is used ranges from about 4° 
C to about 65" C. Usually the temperature at use is greater than 
about 18° C. For diagnostic purposes, the temperature will 
usually be about room temperature or warmer, but less than the 
denaturation temperature of components in the assay. For 
therapeutic purposes, the temperature will usually be body 
temperature, typically about 37° C for humans and mice, though 
under certain situations the temperature may be raised or lowered 
20 in situ or in vitro. 

The size and structure of the polypeptide should generally be 
in a substantially stable state, and usually not in a denatured 
15 state. The polypeptide may be associated with other polypeptides 
25 in a quaternary structure, e.g., to confer solubility, or 

associated with lipids or detergents in a manner which 
approximates natural lipid bilayer interactions. 

The solvent and electrolytes will usually be a biologically 
compatible buffer, of a type used for preservation of biological 
activities, and will usually approximate a physiological aqueous 
solvent. Usually the solvent will have a neutral pH, typically 
between about 5 and 10, and preferably about 7.5. On some 
occasions, one or more detergents will be added, typically a mild 
25 non-denaturing one, e.g., CHS (cholesteryl hemisuccinate) or CHAPS 
(3-[3-cholamidopropyl)dimethylammonio]-l-propane sulfonate), or a 
low enough concentration as to avoid significant disruption of 
structural or physiological properties of the protein. 

Solubility is reflected by sedimentation measured in Svedberg 
units, which are a measure of the sedimentation velocity of a 
molecule under particular conditions. The determination of the 
sedimentation velocity was classically performed in an analytical 
ultracentrifuge, but is typically now performed in a standard 
ultracentrifuge. See, Freif elder (1982) Physical Rio chPTi-i gi-T-y (2 d 
35 ed.), W.H. Freeman; and Cantor and Schimmel (1980) Biophvgir^i 

Chemistry , parts 1-3, W.H. Freeman & Co., San Francisco; each of 
which is hereby incorporated herein by reference. As a crude 
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determination, a sample containing a putatively soluble 
polypeptide is spun in a standard full sized ultracentrifuge at 
about 50K rpm for about 10 minutes, and soluble molecules will 
remain in the supernatant. A soluble particle or polypeptide will 
typically be less than about 3 OS, more typically less than about 
15S, usually less than about 10S, more usually less than about 6S, 
and, in particular embodiments, preferably less than about 4S, and 
more preferably less than about 3S. 

The human complimentary DNA and deduced amino acid sequence 
provided here for DC-PGT contains sequences corresponding to 
twelve putative transmembrane (TM) segments, based upon a 
hydropathicity and structural analysis of DC-PGT. A TopPredll 
{Claros and von Heijne (1994) Como. amH e . R-in^i 10:685-686) 
profile of the dc-pgt sequence showing peaks that reach beyond 
15 -putative' or 'certain- baselines. The 12 transmembrane segments 
correspond to hydrophobic stretches which run approximately from 
amino acids 47-68 (TM1) ; 88-107 (TM2); 117-136 { TM3 ) ; 188-208 
(TM4) ; 225-244 (TM5) ; 279-294 (TM6) 367-386 (TM7 ) ; 412-431 (TM8) • 
450-474 (TM9 ) ; 561-578 (TM10); 597-616 (TM11) ; and 651-675 (TM12)' . 
30 20 Charged amino residues located within the transmembrane domains 

are: glutamine at amino residues 59, 62, 196, 207, 380, 469, 602, 
655, and 675; glutamic acid at residue 95; and arginine at 
residues 607 and 674. Extracellular loops correspond 
35 approximately to amino acid residues 69-87, 137-187, 295-366, 432- 

25 449, 579-596, and 617-650. Putative N-glycosylation sites in the 
exposed, extracellular face of the molecule are located in the 
second and fifth extracellular loops at Asn-X-Ser/Thr motifs 
„ (e.g., 146-148; 176-178; and 538-540). Intracellular portions 

correspond approximately to amino acid residues 1-46, 108-116, 
30 209-224, 295-366, 432-449, 579-596, and 676-709. These boundaries 
will often be the boundaries of segments of interest, which be 
include multiple described segments. 

Transporters of this family are typically 12 transmembrane 
proteins of approximately 650 amino acids in length and include 
35 the organic anion transporters in man and rat, prostaglandin 

transporters of man (Lu, et al. (1996) J. Clin. Tn^f , 98:1142- 
1149) and rat; brain digoxyin transporters and Matrin F/G of rat 
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(Kanai. et al . (1995) Science 268:866-869). Characteristic of 
this family of organic anion transporter proteins is a cysteine 
rich region located in one of the extracellular loops, which 
resembles a zinc finger motif. The DC-PGT cysteine rich reoion is 
located in extracellular loop 5 with cysteines approximately at 
positions 489, 493, 495, 504, 516, 520, 539, 541. 554, and 557 

Other particularly interesting segments of the TNF receptors 
Dubs, MDs. and cyclin E are pointed out. These may also be 
segments of comparison with other proteins or genes. 

III. Physical Variants 

This invention also encompasses proteins or peptides having 
substantial amino acid sequence homology with the amino acid 
sequences of the described proteins. The variants include species 
and polymorphic variants, e.g., naturally occurring forms. 

Amino acid sequence homology, or sequence identity, is 
determined by optimizing residue matches, if necessary, by 
introducing gaps as required. See also Needleham, et al. (1970) 
J ' MP1 " Rin1 48 =443-453; Sankoff, et al . (1983) Chapter One in 
Time Warps, String Frtifs, and Mar-rom ol „ B ■ th, t^^ , ^ 
Practice of Sequence rotnp g ri rnn< Addison _ Wesley/ Reading> ^ and 
software packages from IntelliGenetics, Mountain View, CA; and the 
University of Wisconsin Genetics Computer Group, Madison, WI. 
Sequence identity changes when considering conservative 
substitutions as matches. Conservative substitutions typically 
include substitutions within the following groups: glycine, 
alanine; valine, isoleucine, leucine; aspartic acid, glutamic 
acid; asparagine, glutamine; serine, threonine; lysine, arginine- 
and phenylalanine, tyrosine. Homologous amino acid sequences are 
typically intended to include natural polymorphic or allelic and 
interspecies variations in each respective protein sequence. 
Typical homologous proteins or peptides will have from 25-100% 
identity (if gaps can be introduced) , to 50-100% identity (if 
conservative substitutions are included) with the amino acid 
sequence of the HDTEA84 . Identity measures will be at least about 
35%, generally at least about 40%, often at least about 50%, 
typically at least about 60%, usually at least about 70%, 
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For sequence comparison, typically one sequence acts as a 
reference sequence, to which test sequences are compared. When 
using a sequence comparison algorithm, test and reference 
sequences are input into a computer, subsequence coordinates are 
designated, if necessary, and sequence algorithm program 
parameters are designated. The sequence comparison algorithm then 
calculates the percent sequence identity for the test sequence (s) 
relative to the reference sequence, based on the designated 
program parameters. 

Optical alignment of sequences for comparison can be 
conducted, e.g., by the local homology algorithm of Smith and 
Waterman (1981) Adv. A Pp3 M afh 2:482, by the homology alignment 
algorithm of Needleman and Wunsch (1970) J. Mol . Bim 48:443, by 
the search for similarity method of Pearson and Lipman (1988) 
Proc. Mafl Acad. .Sci , u,^ 85:2444, by computerized 
implementations of these algorithms (GAP, BESTFIT, PASTA, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Dr., Madison, WI) , or by visual 
inspection (see generally Ausubel et al., supra). 

One example of a useful algorithm is PILEUP. PILEUP creates 
a multiple sequence alignment from a group of related sequences 
using progressive, pairwise alignments to show relationship and 
percent sequence identity, it also plots a tree or dendrogram 
showing the clustering relationships used to create the alignment. 
PILEUP uses a simplification of the progressive alignment method 
of Feng and Doolittle (1987) J. MoT , Rvnl , 35:351-360. The method 
used is similar to the method described by Higgins and Sharp 
(1989) CABIQS 5:151-153. The program can align up to 300 
sequences, each of a maximum length of 5,000 nucleotides or amino 
acids. The multiple alignment procedure begins with the pairwise 
alignment of the two most similar sequences, producing a cluster 
of two aligned sequences. This cluster is then aligned to the 
next most related sequence or cluster of aligned sequences. Two 
clusters of sequences are aligned by a simple extension of the 
pairwise alignment of two individual sequences. The final 
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alignment is achieved by a series of progressive, pairwise 
alignments. The program is run by designating specific sequences 
and their amino acid or nucleotide coordinates for regions of 
10 sequence comparison and by designating the program parameters 

5 For example, a reference sequence can be compared to other test 
sequences to determine the percent sequence identity relationship 
using the following parameters: default gap weight (3.00), default 
15 gap l6n Sth weight (0.10), and weighted end gaps. 

Another example of algorithm that is suitable for determining 
percent sequence identity and sequence similarity is the BLAST 
algorithm, which is described Altschul, et al . (1990) j, M o1 
Bi2l^ 215:403-410. Software for performing BLAST analyses is 
publicly available through the National Center for Biotechnology 
Information (http:www.ncbi.nlm.nih.gov/). This algorithm involves 
first identifying high scoring sequence pairs (HSPs) by 
identifying short words of length w in the query sequence, which 
either match or satisfy some positive-valued threshold score T 
when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score 
30 20 threshold (Altschul, et al . , supra). These initial neighborhood 

word hits act as seeds for initiating searches to find longer HSPs 
containing them. The word hits are then extended in both 
directions along each sequence for as far as the cumulative 
alignment score can be increased. Extension of the word hits in 
each direction are halted when: the cumulative alignment score 
falls off by the quantity X from its maximum achieved value; the 
cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W. T, 
and X determine the sensitivity and speed of the alignment. The 
BLAST program uses as defaults a wordlength <w) of 11, the 
45 BLOSOM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. 

Mdf 1 Acad . , Sr i , TTS* 89:10915) alignments (B) of 50. expectation 
(E) of 10. M=5, N=4, and a comparison of both strands. 

In addition to calculating percent sequence identity, the 
BLAST algorithm also performs a statistical analysis of the 
similarity between two sequences (see, e.g., Karlin and Altschul 
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(1 " 3> mt '' ] ftr " d ' Pri. USA 90:5873-5787). One measure of 

similarity provided by the blast algorithm is the smallest sum 
probability (P(N,) / „ hich provides an indication of the 
probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a nucleic acid is 
considered similar to a reference sequence if the smallest sum 
probability in a comparison of the test nucleic acid to the 
reference nucleic acid is less than about 0.1, more preferably 
less than about 0.01, and most preferably less than about 0.001 

A further indication that two nucleic acid sequences of 
polypeptides are substantially identical is that the polypeptide 
encoded by the first nucleic acid is immunologically cross 
reactive with the polypeptide encoded by the second nucleic acid 
as described below. Thus, a polypeptide is typically 
substantially identical to a second polypeptide, for example, 
where the two peptides differ only by conservative substitutions 
Another indication that two nucleic acid sequences are 
substantially identical is that the two molecules hybridize to 
each other under stringent conditions, as described below. 

The isolated DC-PGT, HDTEA84 , HSLJD37R, RANKL, HCC5, Dub MD- 
1, MD-2, or cyclin E2 DNAs can be readily modified by nucleotide 
substitutions, nucleotide deletions, nucleotide insertions, and 
inversions of nucleotide stretches. These modifications result in 
novel DNA sequences which encode these antigens, their 
derivatives, or proteins having similar physiological, 
immunogenic, antigenic, or other functional activity.' These 
modified sequences can be used to produce mutant antigens or to 
enhance expression. Enhanced expression may involve gene 
amplification, increased transcription, increased translation, and 
D other mechanisms. For example, "Mutant HDTEA84 " encompasses a 
polypeptide otherwise falling within the sequence identity 
definition of the HDTEA84 as set forth above, but having an amino 
acid sequence which differs from that of HDTEA84 as normally found 
in nature, whether by way of deletion, substitution, or insertion. 
This generally includes proteins having significant identity with 
a protein having sequence of SEQ ID NO: 6, and as sharing various 
biological activities, e.g., antigenic or immunogenic, with those 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/12366 



48 



20 

15 

25 



30 20 



sequences, and in preferred embodiments contain most of the full 
length disclosed sequences. Full length sequences will typically 
be preferred, though truncated versions, e.g., soluble constructs 
10 *** intact d °*^ S , will also be useful, likewise, genes or 

5 proteins found from natural sources are typically most desired. 

Similar concepts apply to different HDTEA84 proteins, particularly 
those found in various warm blooded animals, e.g., mammals and 
15 b±rds ' ° r fish ' These descriptions are generally meant to 

encompass all HDTEA84 proteins, not limited to the particular 
10 human embodiment specifically discussed. Similar concepts apply 
to the other polypeptides provided. 

DC-PGT, HDTEA84 , HSLJD37R, RANK1, HCC5 , Dub. MD-1, MD-2 , or 
cyclin E2 mutagenesis can also be conducted by making amino acid 
insertions or deletions. Although site specific mutation sites 
are predetermined, mutants need not be site specific. Protein 
mutagenesis can be conducted by making amino acid insertions or 
deletions, or combinations may be generated to arrive at a final 
construct. Insertions include amino- or carboxy- terminal 
fusions. Random mutagenesis can be conducted at a target codon 
and the expressed mutants can then be screened for the desired 
activity. Methods for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known 
in the art. e.g., by M13 primer mutagenesis or polymerase chain 
35 reaction (PCR) techniques. See, e.g., Sambrook, etal. (1989); 

25 Ausubel, et al. (1987 and Supplements); and Kunkel, et al. (1987) 
Methods in Enzvraol 154:367-382. 

The mutations in the DNA normally should not place coding 
sequences out of reading frames and preferably will not create 
complementary regions that could hybridize to produce secondary 
mRNA structure such as loops or hairpins. 

The present invention also provides recombinant proteins, 
45 e - g -' heterologous fusion proteins using segments from these 

proteins. A heterologous fusion protein is a fusion of proteins 
or segments which are naturally not normally fused in the same 
manner. Thus, the fusion product of an immunoglobulin with a 
polypeptide is a continuous protein molecule having sequences 
fused in a typical peptide linkage, typically made as a single 
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translation product and exhibiting properties derived from each 
source peptide. A similar concept applies to heterologous nucleic 
acid sequences . 

The present invention also provides recombinant proteins, 
e.g., heterologous fusion proteins using segments from these 
proteins. A heterologous fusion protein is a fusion of proteins 
or segments which are naturally not normally fused in the same 
manner. A similar concept applies to heterologous nucleic acid 
sequences. Fusion proteins will be useful as sources for 
cleaving, separating, and purifying portions thereof. 

In addition, new constructs may be made from combining 
similar functional domains from other proteins. For example, 
target-binding or other segments may be "swapped" between 
different new fusion polypeptides or fragments. See, e.g., 
15 Cunningham, et al . (1989) Science 243:1330-1336; and C-Dowd, et 
25 al. (1988) J, Biol, rh^m 263:15985-15992. 

The phosphoramidite method described by Beaucage and 
Carruthers (1981) Tetra . i« rr * 22:1859-1862, will produce 
suitable synthetic DMA fragments. A double stranded fragment will 
often be obtained either by synthesizing the complementary strand 
and annealing the strand together under appropriate conditions or 
by adding the complementary strand using DNA polymerase with an 
appropriate primer sequence, e.g., PCR techniques. 

35 

25 IV. Functional Variants 

The blocking of physiological response with, e.g., HDTEA84 
HSLJD37R, RAKKL, HCC5 chemokine, md-1, or MD-2, may result from' 
the inhibition of binding of the respective ligand to signaling 
form of receptor or binding counterstructure, e.g., through 
30 competitive inhibition, in others, binding affinity to substrate 
may be modifiable or competed with, e.g., dc-PGT, Dubs, or cyclin 
E2. Thus, in vitro assays of the present invention will often use 
isolated protein, soluble fragments comprising ligand or substrate 
binding segments of these proteins, or forms attached to solid 
35 phase substrates. These assays will also allow for the diagnostic 
determination of the effects of either binding segment mutations 
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and modifications, or antigen mutations and modifications e g 
HDTEA84, HSLJD37R, RANKL, MD-1, or MD-2 analogs. 

This invention also contemplates the use of competitive drug 
screening assays, e.g., where neutralizing antibodies to antigen 
5 or banding fragments compete with a test compound for binding to 
the protein, e.g., of natural protein sequence. This is 
applicable to substrate binding, e.g.. competitive inhibitors, and 
in receptor interaction, where the protein has a binding 
counters true ture . 

10 "Derivatives- of , e.g.. receptor, antigens include amino 

acid sequence mutants from naturally occurring forms, 
glycosylate variants, and covalent or aggregate conjugates with 
other chemical moieties. Covalent derivatives can be prepared by 
linkage of functionalities to groups which are found in receptor 
15 amino acid side chains or at the N- or C- termini, e .g by 
™ standard means. See, e.g., Lundblad and Noyes (1988) Chemical 

Reaerents for Prnt-^-in ^ * j „ n vols 1 _ 2< CRC ^ ^ 

Boca Raton, FL; Hugli (ed. 1989) Technic. in x^ ein ' nbATn / ^ 
Academic Press, San Diego, CA; and Wong (1991) Chemist nf 
Protein Conjugation and Props Linking , CRC Press, Boca Raton, FL. 

In particular, glycosylation alterations are included, e g 
made by modifying the glycosylation patterns of a polypeptide 
during its synthesis and processing, or in further processing 
steps. See, e.g., Elbein (1987) Ann. Rev. R Wh«n ^.Aan c?1 
Also embraced are versions of the peptides with the same primary 
ammo acid sequence which have other minor modifications, 
including phosphorylated amino acid residues, e.g., 
phosphotyrosine, phosphoserine, or phosphothreonine . 

Fusion polypeptides between these proteins and other 
homologous or heterologous proteins are also provided. Many 
cytokine receptors or other surface proteins are multimeric, e.g., 
homodimeric entities, and a repeat construct may have various 
advantages, including lessened susceptibility to proteolytic 
cleavage. Typical examples are fusions of a reporter polypeptide. 
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35 e.g., luciferase, with a segment or domain of a protein, e.g., a 
receptor-binding segment, so that the presence or location of ' the 
fused ligand may be easily determined. See, e.g.. Dull, et al., 
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U.S. Patent No. 4,859,609. Other gene fusion partners include 
bacterial B-galactosidase. trpE, Protein A, S-lactamase, alpha 
amylase, alcohol dehydrogenase, yeast alpha mating factor, and 
detection or purification tags such as a FLAG sequence of His6 
sequence. See, e.g., Godowski. et al . (1988) Sclent 241:812-816. 
Of particular interest are fusion constructs of receptor with a 
membrane attachment domain. 

Fusion peptides will typically be made by either recombinant 
nucleic acid methods or by synthetic polypeptide methods. 
Techniques for nucleic acid manipulation and expression are 
described generally, e.g., in sambrook, et al. (1989) MoJ^cular 
20 Clon i ng; A Laboratory Manual (2d ed. ) , vols. 1-3, Cold Spring 

Harbor Laboratory; and Ausubel, et al. (eds. 1993) Current 
P rotocols in Molecular Biology , Greene and Wiley, NY. Techniques 
for synthesis of polypeptides are described, e.g., in Merrifield 
(1963) J - flmer - rbpm w - 85:2149-2156; Merrifield (1986) Science 
232: 341-347; Atherton, et al . (1989) Solid Pha^ Ppp ^jH. 
Synthesis: A Practical Ap proach, IRL Press. Oxford; and Grant 
(1992> Svnther.jr- Peptide- ft Ussr' s ran Ho W . H . Freeman, NY. 

This invention also contemplates the use of derivatives of 
the proteins other than variations in amino acid sequence or 
glycosylation. Such derivatives may involve covalent or 
35 aggregative association with chemical moieties . Covalent or 

aggregative derivatives will be useful as immunogens , as reagents 
25 in immunoassays, or in purification methods such as for affinity 
purification of binding partners, e.g., other antigens. The 
desired proteins can be immobilized by covalent bonding to a solid 
support such as cyanogen bromide-activated SEPHAROSE, by methods 
which are well known in the art, or adsorbed onto polyolefin 
surfaces, with or without glutaraldehyde cross-linking, for use in 
the assay or purification of antibodies or an alternative binding 
composition. The protein can also be labeled with a detectable 
group, e.g., for use in diagnostic assays. Purification may be 
effected by an immobilized antibody or complementary binding 
^ 35 partner. Conversely, immunoabsorption or immunodepletion 

techniques may be developed. 
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A solubilized protein or fragment of this invention can be 
used as an immunogen for the production of antisera or antibodies 
specific for binding to the antigen or fragments thereof. 
Purified antigen can be used to screen monoclonal antibodies or 
antigen-binding fragments, encompassing antigen binding fragments 
of natural antibodies, e.g.. Fab, Fab", F(ab) 2 , etc. Purified 
protein can also be used as a reagent to detect antibodies 
generated in response to the presence of elevated levels of the 
antigen or cell fragments containing the antigen, both of which 
may be diagnostic of an abnormal or specific physiological or 
disease condition. This invention contemplates antibodies raised 
against amino acid sequences encoded by nucleotide sequences 
described, or fragments of proteins containing it. in particular, 
this invention contemplates antibodies having binding affinity to' 
15 or being raised against specific fragments, e.g., which are 

predicted to lie outside of zhe lipid bilayer, both extracellular 
or intracellular. 

The present invention contemplates the isolation of 
additional closely related species variants. Southern and 
Northern blot analysis should establish that similar genetic 
entities exist in other mammals. it is likely that these proteins 
are widespread in species variants, e.g., rodents, lagomorphs, 
carnivores, artiodactyla, perissodactyla, and primates. 

The invention also provides means to isolate a group of 
related antigens displaying both distinctness and similarities in 
structure, expression, and function. Elucidation of many of the 
physiological effects of the molecules will be greatly accelerated 
by the isolation and characterization of additional distinct 
species variants of them. In particular, the present invention 
30 provides useful probes for identifying additional homologous 
genetic entities in different species. 
45 The isolated genes will allow transformation of cells lacking 

expression of a corresponding protein, e.g., either species types 
or cells which lack corresponding antigens and exhibit negative 
35 background activity. This should allow analysis of the function 
of genes in comparison co untransf ormed control cells. 
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Dissection of critical structural elements which effect the 
various activation or differentiation functions mediated through 
these antigens is possible using standard techniques of modern 
molecular biology, particularly in comparing members of the 
related class. See, e.g., the homolog-scannina mutagenesis 
technique described in Cunningham, et al . (1989) Science 243:1339- 
1336; and approaches used in O'Dowd, et al. (1988) J. Biol, rh^ 
263:15985-15992; and Lechleiter. et al . (1990) EMBO .T. 9:4381- 
4390. 

The invention also provides, in the context of the DC-PGT 
means to isolate a group of related organic anion transporters' 
e.g., other vertebrate prostaglandin transporters, displaying both 
distinctness and similarities in structure, expression, and 
function. Elucidation of many of the physiological effects of the 
15 antigens will be greatly accelerated by the isolation and 

characterization of distinct species variants. m particular, the 
present invention provides useful probes for identifying 
additional homologous genetic entities in different species The 
results described above indicate that sufficiently homologous 
genes exist in other species that cross-species hybridization is 
likely to allow successful cloning. 

The isolated genes will allow transformation of cells lacking 
expression of a described gene, e.g., prostaglandin transporter, 
various species types or cells which lack corresponding proteins 
can be isolated, and should exhibit negative background activity. 
Expression of transformed genes will allow isolation of 
antigenically pure cell lines, with defined or single specie 
variants. This approach will allow for more sensitive detection 
and discrimination of the physiological effects of the gene, e.g., 
prostaglandin transporters. Subcellular fragments, e.g., 
cytoplasts or membrane fragments, can be isolated and used. 

The DC-PGT genes may also be useful to increase the rate of 
transport of desired prostaglandins into transformed cells. Thus, 
the transporter may be transformed into cells for targeting of 
incorporation of desired substrates or analogs. For instance, it 
may be useful to incorporate specific modified prostaglandins ' into 
those cells, which may become more susceptible to other 
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treatments, or directly affected. Thus, specific dendritic cell 
subsets may be transformed to become more sensitive to 
prostaglandins or specific substrates. Conversely, such cells may 
be useful screening targets to identify entities which can block 
transport, thereby preventing uptake of substrate. 

Structural studies of the transporter will lead to design of 
new variants, particularly analogs exhibiting modified binding 
affanity, or perhaps, altered rate of transporter activity This 
can be combined with previously described screening methods to 
isolate variants exhibiting desired spectra of activities 
Alternatively, many different prostaglandins and analogs thereof 
may be screened for either transporter binding affinity or 
transporter transfer. The transporter may require a direct energy 
source, e.g., ATP or other nucleotide triphosphate, or may depend 
15 upon an ion gradient, as described above. 

In the context of the Dubs and cyclin E2. intracellular 
functions would probably involve segments of the antigen which are 
normally accessible to the cytosol, as would segments of the 
receptors. However, protein internalization may occur under 
30 20 certain circumstances, and interaction between intracellular 

components and "extracellular" components may occur. The specific 
segments of interaction of protein with other intracellular 
components may be identified by mutagenesis or direct biochemical 
35 means, e.g., cross-linking or affinity methods . 

Structural analysis by crystallographic or other physical 
methods will also be applicable. Further investigation of the 
mechanism of signal transduction will include study of associated 
components which may be isolatable by affinity methods or by 
genetic means, e.g., complementation analysis of mutants. 

Further study of the expression and control of the proteins 
will be pursued. The controlling elements associated with the 
45 antigens should exhibit differential physiological, developmental, 

tissue specific, or other expression patterns. Upstream or 
downstream genetic regions, e.g., control elements, are of 
35 interest. In particular, physiological or developmental variants, 
e.g.. multiple alternatively processed forms of the antigen might 
be found. Thus, differential splicing of message may lead to an 
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assortment of membrane bound forms, soluble forms, and modified 
versions of antigen. 

Structural studies of the antigens will lead to design of new 
antigens, particularly analogs exhibiting agonist or antagonist 
properties on the molecule. This can be combined with previously 
described screening methods to isolate antigens exhibiting desired 
spectra of activities. 



10 



15 



V. Antibodies 

Antibodies can be raised to the various described 
polypeptides, including species, polymorphic, or allelic variants 
and fragments thereof, both in their naturally occurring forms and 
m their recombinant forms. Additionally, antibodies can be 
raised to the proteins in either their active forms or in their 
inactive forms, including native or denatured versions. Anti- 
25 idiotypic antibodies are also contemplated. 

Antibodies, including binding fragments and single chain 
versions, against predetermined fragments of the antigens can be 
raised by immunization of animals with conjugates of the fragments 
30 20 with immunogenic proteins. Monoclonal antibodies are prepared 

from cells secreting the desired antibody. These antibodies can 
be screened for binding to normal or defective polypeptide, or 
screened for agonistic or antagonistic activity. Antibodies may 
be agonistic or antagonistic, e.g., by sterically blocking partner 
25 or substrate binding. These monoclonal antibodies will usually 

bind with at least a K D of about 1 mM, more usually at least about 
300 HM, typically at least about 100 jm, more typically at least 
about 30 mm, preferably at least about 10 pi, and more preferably 
at least about 3 um or better. More preferred embodiments may 
30 have even higher affinities, e.g., at least 300 nM, 30 nM, 3 nM, 
or perhaps even picomolar affinity. 

The term -binding composition- refers to molecules that bind 
with affinity and selectivity to, e.g., the DC-PGT, e.g., in an 
antibody-antigen interaction. However, other compounds, e.g., 
35 accessory proteins, may also specifically and/or selectively 

associate with the antigen to the exclusion of other molecules. 
Typically, the association will be in a natural physiologically 



SUBSTITUTE SHEET ( rule 26 ) 



55 



WO 00/01817 



PCTAJS99/12366 



56 



10 



15 



20 



10 



15 



relevant protein-protein interaction, either covalent or non- 
covalent, and may include members of a multiprotein complex 
including carrier compounds or dimerization partners The ' 
molecule may be a polymer, or chemical reagent. No implication as 
to whether an antigen is necessarily a convex shaped molecule 
e.g.. the ligand or the receptor of a ligand-receptor interaction 
is necessarily represented, other than whether the interaction 
exhibits similar specificity, e.g., specific or selective 
affinity. A functional analog, may be a polypeptide with 
structural modifications, e.g., a mutein, or may be a wholly 
unrelated molecule, e.g., whi ch has a molecular shape which 
interacts with the appropriate ligand binding determinants The 
ligands may serve as agonists or antagonists of the receptor, see 
e.g., Goodman, et al . Goodman f. G1 W«t- The Ph,^^ ^ 
Pases of There in, (current edition) Pergamon Press. Tarrytown 

The term "binding agent: antigen complex", as used herein 
refers to a complex of a binding agent and antigen, eg a rc _ PGT 
„„ Pr ° teln ' that 15 f0rmed b ^ specific binding of the binding agent 
20 to antigen. Specific or selective binding of the binding agent 
means that the binding agent has a specific binding site e g 
antigen binding site, that recognizes a site on the antigen For 
example, antibodies raised to a DC-PGT protein and recognizing an 
epitope on the protein are capable of forming a binding agent -DC- 
PGT protein complex by specific selective binding. Typically the 
formation of a binding agent :DC-PGT protein complex allows the 
qualitative or quantitative measurement of DC-PGT protein in a 
mixture of other proteins and biologies. The term "antibody -DC- 
PGT protein complex" refers to an embodiment in which the binding 
30 agent, e.g., is the antigen binding portion from an antibody The 
antibody may be monoclonal, polyclonal, or a binding fragment of 
45 3X1 mtib «^ e.g., an Fab or F(ab)2 fragment. The antibody will 

preferably be a polyclonal antibody for cross-reactivity testing 
purposes . 

The phrase "specifically binds to an antibody" or 
"specifically immunoreactive with", when referring to a protein or 
peptide, refers to a binding reaction which is determinative of 
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the presence of the protein in the presence of a heterogeneous 
population of other proteins and other biological components. 
Thus, under designated immunoassay conditions, the specified 
antibodies bind to a particular protein and do not sionif icantly 
bmd other proteins present in the sample. Specific binding to an 
antibody under such conditions may require an antibody that is 
selected for its specificity or selectivity for a particular 
protein. Often, the serum can be immunoselected or 
immunodepleted, to minimize crossreactivity with a specific target 
protein . 

A DC-PGT polypeptide that specifically binds to, or that is 
specifically immunoreactive with, an antibody, e.g., such as a 
polyclonal antibody, generated against a defined immunogen, e.g., 
such as an immunogen consisting of an amino acid sequence of SEQ 
ID NO: 2, or fragments thereof, or a polypeptide generated from 
the nucleic acid of SEQ ID NO: 1 is typically determined in an 
immunoassay. Included within the metes and bounds of the present 
invention are those nucleic acid sequences described herein, 
including functional variants, that encode polypeptides that 
selectively bind to polyclonal antibodies generated against the 
prototypical DC-PGT polypeptide as structurally and functionally 
defined herein. The immunoassay typically uses a polyclonal 
antiserum which was raised, e.g., to a protein of SEQ ID NO: 2. 
This antiserum is selected to have low crossreactivity against 
appropriate other PGT family members, preferably from the same 
species, and any such crossreactivity is removed by 
immunoabsorption prior to use in the immunoassay. Appropriate 
selective serum preparations can be isolated, and characterized. 

The purified protein or defined peptides are useful for 
generating antibodies by standard methods, as described above. 
Synthetic peptides or purified protein can be presented to an 
immune system to generate monoclonal or polyclonal antibodies. 
See. e.g., Coligan (1991) Current Pr otocols ^ Tmunoin^ 
Wiley/Greene; and Harlow and Lane (1989) Antibody • a i^r^nry 
Manual, Cold Spring Harbor Press. Alternatively, the HDTEA84 can 
be used as a specific binding reagent, and advantage can be taken 
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of its specificity of binding, much like an antibody would be 
used. 

For example, the specific binding composition could be used 
for screening of an expression library made from a cell line which 
expresses an HDTEA84, HSLJD37R, or RANKL. The screening can be 
standard staining of surface expressed antigen constructs, or by 
panning. Screening of intracellular expression can also be 
performed by various staining or immunofluorescence procedures. 
The binding compositions could.be used to affinity purify or sort 
out cells expressing the protein. 

In order to produce antisera for use in an immunoassay, the 
20 Protein, e.g., of SEQ ID NO: 2, is isolated as described herein. 

For example, recombinant protein may be produced in a mammalian 
cell line. An appropriate host, e.g., an inbred strain of mice 
15 such as Balb/c, is immunized with the protein of SEQ ID NO: 2 
25 usin 9 a standard adjuvant, such as Freund's adjuvant, and a 

standard mouse immunization protocol (see Harlow and Lane). 
Alternatively, a substantially full length synthetic peptide 
derived from the sequences disclosed herein can be used as an 
30 20 immunogen. Polyclonal sera are collected and titered against the 

immunogen protein in an immunoassay, e.g., a solid phase 
immunoassay with the immunogen immobilized on a solid support. 
Polyclonal antisera with a titer of 1 0 4 or greater are selected 
35 ^ tested for their cross reactivity against other PGT family 

25 members, e.g., human or rat PGT, using a competitive binding 

immunoassay such as the one described in Harlow and Lane, supra, 
at pages 570-573. Preferably at least two PGT family members are 
used in this determination in conjunction with the target. These 
PGT family members can be produced as recombinant proteins and 
isolated using standard molecular biology and protein chemistry 
techniques as described herein. Thus, antibody preparations can 
45 156 identif ied or produced having desired selectivity or 

specificity for subsets of PGT family members . 

Immunoassays in the competitive binding format can be used 
for the crossreactivity determinations. For example, the protein 
of SEQ ID NO: 2 can be immobilized to a solid support. Proteins 
added to the assay compete with the binding of the antisera to the 
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immobilized antigen. The ability of the above proteins to compete 
wath the binding of the antisera to the immobilized protein is 
compared to the protein of SEQ ID NO: 2. The percent 
crossreactivity for the above proteins is calculated, using 
standard calculations . Those antisera with less than 10% 
crossreactivity with each of the proteins listed above are 
selected and pooled. The cross-reacting antibodies are then 
removed from the pooled antisera by immunoabsorption or 
immunodepletion with the above-listed proteins. 

The immunoabsorbed and pooled antisera are then used in a 
competitive binding immunoassay as described above to compare a 
20 second P ro tein to the immunogen protein, m order to make this 

comparison, the two proteins are each assayed at a wide range of 
concentrations and the amount of each protein required to inhibit 
15 50% of the binding of the antisera to the immobilized protein is 
25 determined. If the amount of the second protein required is less 

than twice the amount of the protein of, e.g., SEQ ID NO: 2 that 
is required, then the second protein is said to specifically bind 
to an antibody generated to the immunogen. 

The antibodies of this invention can also be useful in 
diagnostic applications. As capture or non-neutralizing 
antibodies, they can be screened for ability to bind to the 
antigens without inhibiting binding by a partner. As neutralizing 
antibodies, they can be useful in competitive binding assays. 
They will also be useful in detecting or quantifying a described 
protein or its binding partners. See, e.g., Chan (ed. 1987) 
Immunology: A Practical Academic Press, Orlando, FL; Price 

and Newman (eds. 1991) P rinciples and P^cri^ o f i M „ nnagcaY 
Stockton Press, N.Y.; and Ngo (ed. 1988) Nonisotonir iMimos.^.v 
30 Plenum Press, N.Y. Cross absorptions or depletions and other 
tests will identify antibodies which exhibit various spectra of 
specificities, e.g., unique or shared species specificities. 

Further, the antibodies, including antigen binding fragments, 
of this invention can be potent antagonists that bind to the 
antigen and inhibit functional binding or inhibit the ability of a 
binding partner to elicit a biological response. They also can be 
useful as non-neutralizing antibodies and can be coupled to toxins 
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or radionuclides so that when the anybody binds to antigen a 
cell expressing it, e.g., on its surface, is killed. Further 
these antibodies can be conjugated to drugs or other therapeutic 
agents, either directly or indirectly by means of a linker, and 
may effect drug targeting. They may be labeled for histology 
evaluation . 

Antigen fragments may be joined to other materials 
particularly polypeptides, as fused or covalently joined 
polypeptides to be used as immunogens. An antigen and its 
fragments may be fused or covalently linked to a variety of 
immunogens, such as keyhole limpet hemocyanin, bovine serum 
albumin, tetanus toxoid, etc. See Microbio W H oeber Medical 
Division, Harper and Row, l 969; Lands teiner (1962) Specif^ nf 
Seroloqica! R^rHon. , Dover Publications, New York; Williams, et 
al> <1967) Methods in TmnunnW r ,, nnptlMn ^^, vq1 lf 

Academic Press, New York,- and Harlow and Lane (1988) Antibody ■ 
A Laboratory Man uaT, CSH Press, NY, for descriptions of methods of 
preparing polyclonal antisera. 

in some instances, it is desirable to prepare monoclonal 
0 antabodaes from various mammalian hosts, such as mice, rodents 
primates, humans, etc. Description of techniques for preparing 
such monoclonal antibodies may be found in, e.g., Stites, et al 
(eds - } Basic and n i.nical Tmmun ol or y (4th ed. ) , Lange Medical 
Publications, Los Altos, CA, and references cited therein; Harlow 
5 and Lane (1988) MtiMi^- A Lahnr,^, CSH PresS; 

Coding (1986) Monoclone AnHhodi^- ^^ 1n1pH an ^ D _.;„„ (2d 
ed.). Academic Press, New York; and particularly in Kohler and 
Mxlstein (1975) in Nature 256:495-497, which discusses one method 
of generating monoclonal antibodies. 
) Other suitable techniques involve in vitro exposure of 

lymphocytes to the antigenic polypeptides or alternatively to 
selection of libraries of antibodies in phage or similar vectors 
See, Huse, et al. (1989) -Generation of a Large Combinatorial 
Library of the Immunoglobulin Repertoire in Phage Lambda, » Science 
246:1275-1281; and Ward, et al. (1989) Nature 341:544-546. The 
polypeptides and antibodies of the present invention may be used 

SUBSTITUTE SHEET ( rule 26 ) 



10 



WO00/0,8,7 PCT/US99/12366 

61 

with or without modification, including chimeric or humanized 
antibodies. Frequently, the polypeptides and antibodies will be 
labeled by joining, either covalently or non-covalently a 
substance which provides for a detectable signal. A wide variety 
of labels and conjugation techniques are known and are reported 
extensively in both the scientific and patent literature. 
Suitable labels include radionuclides, enzymes, substrates, 
cofactors, inhibitors, fluorescent moieties, chemi luminescent 
moieties, magnetic particles, and the like. Patents, teaching the 
use of such labels include U.S. Patent Nos . 3,817,837; 3 850 752- 
3.939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. Also' 
recombinant immunoglobulins may be produced, see Cabilly u S 
Patent No. 4,815,567; Moore, et al . , U.S. Patent No. 4,642 334- 
and Queen, et al . (1989) Proc. Nat 1 1 ^ cad . rrea 86:10029- ' 

10033. 

The antibodies of this invention can also be used for 
affinity chromatography in isolating the protein. Columns can be 
prepared where the antibodies are linked to a solid support, e.g., 
particles, such as agarose, Sephadex, or the like, where a cell 
lysate may be passed through the column, the column washed, 
followed by increasing concentrations of a mild denaturant,' 
whereby the purified protein will be released. See, e.g., Wilchek 
et al. Q984) Meth. En^vmnl 104:3-55. 

Antibodies raised against each protein will also be useful to 
raise anti-idiotypic antibodies. These will be useful in 
detecting or diagnosing various immunological conditions related 
to expression of the respective antigens. 

VT. Nucleic Acids 

The described peptide sequences and the related reagents are 
useful in detecting, isolating, or identifying a DNA clone 
encoding, e.g., the DC-PGT, HDTEA84, HSLJD37R, RANKL, HCC5, Dubll, 
Dubl2, MD-1, MD-2, or cyclin E2 polypeptides, e.g., from a 'natural 
source. Typically, the nucleic acids, particularly natural genes, 
will be useful in isolating a gene from mammal, and similar 
procedures will be applied to isolate genes from other species, 
e.g., warm blooded animals, such as birds and mammals. They will 
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be useful for isolating genes from domestic pets, e.g., dogs and 
cats, and livestock, e.g., horse, pigs, cat tle, sheep, chickens 
turkeys, fish, etc. Cross hybridization will allow isolation of 
respective counterpart genes from other species. A number of 
5 different approaches should be available to successfully isolate a 
suitable nucleic acid clone. 

The peptide sequences allow preparation of peptides to 
generate antibodies to recognize such segments, and various 
dlff erent methods may be used to prepare such peptides. As used 
herein, e.g., the term prostaglandin transporter shall encompass 
when used in a protein context, a protein having an amino acid 
sequence shown in Table 1, or a significant fragment of such a 
protein. It also refers to a vertebrate, e.g., mammal, including 
human, derived polypeptide which exhibits similar biological 
15 function, e.g., antigenic, or interacts with prostaglandin 
transporter specific binding components, e.g., specific 
antibodies. These binding components, e.g., antibodies, typically 
bind to a prostaglandin transporter with high affinity eg at 
least about 100 nM, usually better than about 30 nM, preferably 
0 better than about 10 nM, and more preferably at better than about 
3 nM. Still higher affinities are possible, e.g., 100 pM, 30 P M, 
100 fM, etc. 

This invention contemplates use of isolated DNA or fragments 
of the present invention to encode a structurally related e g 
5 antigenically related, or biologically active protein, e.g 

substrate binding or transporting, prostaglandin transporter TNF 
receptor-like proteins, chemokine, Dubs, surface receptors, or 
cell cycle regulatory proteins, or polypeptide fragments thereof 
In addition, this invention covers isolated or recombinant DNA 
0 which encodes a structurally related or biologically active 

protein or polypeptide and that is capable of hybridizing under 
appropriate conditions with the DNA sequences described herein. 
Said biologically active protein or polypeptide can be an intact 
antigen, or fragment, and have an amino acid sequence as disclosed 
in Tables 1-13. Further, this invention covers the use of 
isolated or recombinant DNA, or fragments thereof, which encode 
proteins which are homologous to the respective genes or which 
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were isolated using cDNA encoding the proteins as a probe. 
Preferably such homologous genes or proteins will be natural forms 
isolated from other vertebrates, e.g., warm blooded animals, 
including mammals, such as primates. The isolated DNA can have 
the respective regulatory sequences in the 5' and 3 • flanks, e.g 
promoters, enhancers, poly-A addition signals, and others. 

An -isolated" nucleic acid is a nucleic acid, e.g., an RNA 
SNA, or a mixed polymer, which is substantially separated from ' 
other components which naturally accompany a native sequence, 
e.g., ribosomes, polymerases, and flanking genomic sequences ' from 
the ongmating species. The term embraces a nucleic acid 
sequence which has been removed from its naturally occurring 
intracellular environment, and includes recombinant or cloned DNA 
isolates and chemically synthesized analogs or analogs 
biologically synthesized by heterologous systems. A substantially 
pure molecule includes once or currently isolated forms of the 
molecule. Alternatively, a purified species may be separated from 
host components from a recombinant expression system. 
Generally, the nucleic acid will be in a vector or fragment less 
than about 50 kb, usually less than about 30 kb, typically less 
than about 10 kb, and preferably less than about 6 kb. 

An isolated nucleic acid will generally be a homogeneous 
composition of molecules, but will, in some embodiments, contair 
manor heterogeneity. This heterogeneity is typically found at the 
polymer ends or portions not critical to a desired biological 
function or activity. 

The peptide segments can also be used to predict appropriate 
oligonucleotides to screen a library. The genetic code, e.g., 
reverse translation, can be used to select appropriate 
oligonucleotides useful as probes for screening. See, eg SE Q 
ID NO: 1, 5, 7, 9, 11, 16, 18, 20, 22, 24, 31, 33, 35, 37, 41, 43, 
47, or 53. In combination with polymerase chain reaction (PCR) 
techniques, synthetic oligonucleotides will be useful in selecting 
correct clones from a library. Complementary sequences will also 
be used as probes, primers, or antisense strands. Various 
fragments should be particularly useful, e.g., coupled with 
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anchored vector or poly-A complementary PCR techniques or with 
complementary DNA of other peptides. 

This invention contemplates use of isolated DNA or fragments 
to encode a biologically active corresponding polypeptide. In 
addition, this invention covers isolated or recombinant DNA which 
encodes a biologically active protein or polypeptide which is 
capable of hybridizing under appropriate conditions with the DNA 
sequences described herein. Said biologically active protein or 
polypeptide can be an intact antigen, or fragment, and have an 
amino acid sequence disclosed in, e.g., SEQ ID NO: 2, 6 8 10 
12, 17, 19, 21, 23, 25, 32, 34, 36, 38, 42, 44, 46, 48.' 49, or' 5 4 
Further, this invention covers the use of isolated or recombinant 
DNA, or fragments thereof, which encode proteins which are 
homologous to a described protein or which was isolated using cDNA 
encodmg such protein as a probe. The isolated DNA can have the 
respective regulatory sequences in the 5 1 and 3 ■ flanks, e.g., 
promoters, enhancers, poly-A addition signals, and others. 

A "recombinant" nucleic acid is defined either by its method 
of production or its structure. In reference to its method of 
production, e.g., a product made by a process, the process is use 
of recombinant nucleic acid techniques, e.g., involving human 
intervention in the nucleotide sequence, typically selection or 
production. Alternatively, it can be a nucleic acid made by 
generating a sequence comprising fusion of two fragments which are 
not naturally contiguous to each other, but is meant to exclude 
products of nature, e.g.. naturally occurring mutants. Thus, 
e.g., products made by transforming cells with any unnaturally 
occurring vector is encompassed, as are nucleic acids comprising 
sequence derived using any synthetic oligonucleotide process. 
Such is often done to replace a codon with a redundant codon 
encoding the same or a conservative amino acid, while typically 
introducing or removing a sequence recognition site. 

Alternatively, it is performed to join together nucleic acid 
segments of desired functions to generate a single genetic entity 
comprising a desired combination of functions not found in the 
commonly available natural forms. Restriction enzyme recognition 
sites are often the target of such artificial manipulations, but 
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other site specific targets, e.g., promoters. DNA replication 

sites, regulation sequences, control sequences, or other useful 

featUrSS be inc °rP°rated by design. A similar concept is 

intended for a recombinant , e.g., fusion, polypeptide 

5 Specifically included are synthetic nucleic acids which by 

genetic code redundancy, encode polypeptides similar to ' fragments 

° f th£Se ""B"-. fusions of sequences from various different 

'5 species variants . 

A significant "fragment" in a nucleic acid context is a 
contiguous segment of at least about 17 nucleotides, generally at 
least about 22 nucleotides, ordinarily at least about 29 
nucleotides, more often at least about 35 nucleotides, typically 
at least about 41 nucleotides, usually at least about 47 
nucleotides, preferably at least about 55 nucleotides, and in 
particularly preferred embodiments will be at l east about 60 or 
25 more nucleotides . 

A DNA which codes for a DC-PGT, HDTEA84, HSL0D37R, RANKL 
HCC5, Dub, MD-1, MD-2, or cyclin E2 protein will be particularly 
useful to identify genes, mRNA, and cDNA species which code for 
related or homologous proteins, as well as DNAs which code for 
homologous proteins from different species. There are likely 
homologs in other species, including primates, rodents, birds, and 
fish. various such proteins should be homologous and are 
encompassed herein. However, even genes encoding proteins that 
have a more distant evolutionary relationship to the antigen can 
readily be isolated under appropriate conditions using these 
sequences if they are sufficiently homologous. Primate proteins 
are of particular interest. 

Recombinant clones derived from the genomic sequences, e g 
containing introns, will be useful for transgenic studies, 
including, e.g., transgenic cells and organisms, and for gene 
therapy. See, e.g., Goodnow (1992) "Transgenic Animals" in Roitt 
<GdJ B PCVC ] onP*tif l Of Tmmnnoloqy , Academic Press, San Diego, pp. 
1502-1504; Travis (1992) Science 256:1392-1394; Kuhn, et al! 
(1991) Science 254:707-710; Capecchi (1989) Sci«n~. 244:1288; 
50 Robertson (1987 ed. ) Teratocarci nomas „nrt ^H^ pic sttm M , a 
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Pract i cal Ap proach , IRL Press, Oxford; and Rosenberg (1992) 
Clinica l Oncolocry 10:180-199. 

Substantial homology in the nucleic acid sequence comparison 
context means either that the segments, or their complementary 
strands, when compared, are identical when optimally aligned with 
appropriate nucleotide insertions or deletions, in at least about 
50% of the nucleotides, generally at least about 58%, ordinarily 
at least about 65%, often at least about 71%, typically at least 
about 77%, usually at least about 85%, preferably at least about 
95 to 98% or more, and in particular embodiments, as high as about 
99% or more of the nucleotides. Alternatively, substantial 
homology exists when the segments will hybridize under selective 
hybridization conditions, to a strand, or its complement, 
typically using a sequence of DC-PGT, e.g., in SEQ ID NO- 1 
Topically, selective hybridization will occur when there is at 
least about 55% homology over a stretch of at least about 30 
nucleotides, preferably at least about 75% over a stretch of about 
25 nucleotides, and most preferably at least about 90% over about 
20 nucleotides. See, Kanehisa (1984) Wuc. AHH. pp. 12:203-213. 
The length of homology comparison, as described, may be over 
longer stretches, and in certain embodiments will be over a 
stretch of at least about 17 nucleotides, usually at least about 
28 nucleotides, typically at least about 40 nucleotides, and 
preferably at least about 75 to 100 or more nucleotides. 

Stringent conditions, in referring to homology in the 
.hybridization context, will be stringent combined conditions of 
salt, temperature, organic solvents, and other parameters, 
typically those controlled in hybridization reactions. Stringent 
temperature conditions will usually include temperatures in excess 
of about 30° C, usually in excess of about 37° c, typically in 
excess of about 55° C, preferably in excess of about 70° C. 
Stringent salt conditions will ordinarily be less than about 1000 
mM, usually less than about 400 mM, typically less than about 250 
mM, preferably less than about 150 mM. However, the combination 
of parameters is much more important than the measure of any 
single parameter. See, e.g., Wetmur and Davidson (1968) J. Mol . 
BifiLu 31:349-370. Hybridization under stringent conditions should 
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give a background of at least 2-fold over background, preferably 
at least 3-5 or more. 

DC-PGT. HDTEA84 , HSU7D37R, RANKL, HCC5 , Dubll, Dubl2 , MD-1 
MD-2, or cyclin E2 from other mammalian species can be cloned and 
isolated by cross-species hybridization of closely related 
species. Homology may be relatively low between distantly related 
species, and thus hybridization of relatively closely related 
species is advisable. Alternatively, preparation of an antibody 
preparation which exhibits less S p ecies specificity may be useful 
m expression cloning approaches. 
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20 VI1 - Making Proteins; Mimetics 

Nucleic acids which encodes the described proteins, or 
fragments thereof, can be obtained by chemical synthesis 
screening cDNA libraries, or screening genomic libraries prepared 
from a wide variety of cell lines or tissue samples. See e g 
Okayama and Berg (1982) Mol, M1 . B jol 2:161-170; Gubler and ' 
Hoffman (1983) Gene 25:263-269; and Glover (ed. 1984) 
A Practical ftp proarh, IRL Press, Oxford. Alternatively, the 
sequences provided herein provide useful PGR primers or allow 
synthetic or other preparation of suitable genes encoding a 
receptor; including, naturally occurring embodiments. 

DNA can be expressed in a wide variety of host cells for the 
synthesis of a full-length protein, or fragments, which can in 
25 turn, e.g., be used to generate polyclonal or monoclonal 

antibodies; for binding studies; for construction and expression 
of modified molecules; for structure/function studies; and for 
controls in detection assays. Each antigen or its fragments can 
be expressed in host cells that are transformed or transfected 
30 with appropriate expression vectors. These molecules can be 
substantially purified to be free of protein or cellular 
45 contaminants, other than those derived from the recombinant host 

and therefore are particularly useful in pharmaceutical 
compositions when combined with a pharmaceutical^ acceptable 
35 carrier and/or diluent. The antigen, or portions thereof, may be 
expressed as fusions with other proteins. 
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Vectors, as used herein, comprise plasmids, viruses, 
bacteriophage, integratable DNA fragments, and other vehicles 
which enable the integration of DNA fragments into the genome of 
the host. see, e.g., Pouwels, et al. (1985 and Supplements) 
Cloning Vectors: A Taborat-nry M M „ a i Elsevier, N.Y.; and 
Rodriguez, et al . (1988 eds.) Vector.,, A fi„ry ev of ^^.u. 
Cloning Vectors and Their n , S Pg, Buttersworth, Boston, MA. 
Expression vectors are typically self-replicating DNA or RNA 
constructs containing the desired antigen gene or its fragments, 
usually operably linked to suitable genetic control elements that 
are recognized in a suitable host cell. These control elements 
are capable of effecting expression within a suitable host. The 
specific type of control elements necessary to effect expression 
will depend upon the eventual host cell used. Generally the 
genetic control elements can include a prokaryotic promoter system 
or a eukaryotic promoter expression control system, and typically 
include a transcriptional promoter, an optional operator to 
control the onset of transcription, transcription enhancers to 
elevate the level of mRNA expression, a sequence that encodes a 
suitable ribosome binding site, and sequences that terminate 
transcription and translation. Expression vectors also usually 
contain an origin of replication that allows the vector to 
replicate independently of the host cell. 

For purposes of this invention, DNA sequences are operably 
linked when they are functionally related to each other. For 
example, DNA for a presequence or secretory leader is operably 
linked to a polypeptide if it is expressed as a preprotein or 
participates in directing the polypeptide to the cell membrane or 
in secretion of the polypeptide. A promoter is operably linked to 
a coding sequence if it controls the transcription of the 
polypeptide,- a ribosome binding site is operably linked to a 
coding sequence if it is positioned to permit translation. 
Usually, operably linked means contiguous and in reading frame, 
however, certain genetic elements such as repressor genes are not 
contiguously linked but still bind to operator sequences that in 
turn control expression. See e.g., Rodriguez, et al., Chapter 10, 
pp. 205-23 6; salbas and Bolivar (1990) Methods <n Bn Tr »i 185:14- 
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37,- and Ausubel. et al . (1993) Current , p M - 1r _ £nJ ^ L 

Biology, Greene and Wiley, ny. 

Representative examples of suitable exoression vectors 
include pCDBRl; PCD, see Okayama. et al. (1985) Mol. Cen 
5:1136-1142; pMClneo Poly-A. see Thomas, et al . (1987) Call 
51:503-512; and a baculovirus vector such as P AC 373 or pAC 610 
See. e.g.. Miller (1988) Ann. R.v. m^^k^i 42:177-195 
Usually, expression vectors are designed for stable replication in 
thexr host cells or for amplification to greatly increase the 
total number of copies of the desirable gene per cell. It is not 
always necessary to require that an expression vector replicate in 
a host cell, e.g., it is possible to effect transient expression 
of the antigen or its fragments in various hosts using vectors 
that do not contain a replication origin that is recognized by the 
15 nost cell. It is also possible to use vectors that cause 

integration of a gene or its fragments into the host DNA by 
recombination, or to integrate a promoter which controls 
expression of an endogenous gene. 

Adenovirus techniques are available for expression of the 
genes m various cells and organs. See, e.g., Hitt, et al (1997) 
Adv. Pharmacol 40:137-195; and literature from Quantum 
Biotechnologies, Montreal. Canada. Animals may be useful to 
determine the effects of the gene on various developmental or 
physiologically functional animal systems. 

Suitable host cells include prokaryotes, lower eukaryotes 
and higher eukaryotes. Prokaryotes include both gram negative and 
gram positive organisms, e.g., E . coli ^ B . subtilis . Lower 
eukaryotes include yeasts, e.g., S . cerevisiae and Pichia, and 
species of the genus Dictyostelium. Higher eukaryotes include 
established tissue culture cell lines from animal cells, both of 
non-mammalian origin, e.g., insect cells, and birds, and of 
mammalian origin, e.g., human, primates, and rodents. 

Prokaryotic host-vector systems include a wide variety of 
vectors for many different species. As used herein, E. coli and 
its vectors will be used generically to include equivalent vectors 
used xn other prokaryotes. A representative vector for amplifying 
DNA is pBR322 or many of its derivatives. Vectors that can be 
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used to express the prostaglandin transporter or its fragments 
include, but are not limited to, such vectors as those containing 
the lac promoter (pUC-series) , trp promoter ( P BR322-trp) ; Ipp 
promoter (the plN-series) ; lambda-pP or P R promoters (pOTS) • or 
5 hybrid promoters such as ptac (pDR540) . See Brosius et al (i 98 8> 
"Expression Vectors Employing Lambda-, trp-, lac-, and Ipp-derived 
Promoters", in Rodriguez and Denhardt (eds., Vector.- a .„^ y „ f 
Molecular Cloning Vectors ^d TW n^g, Buttersworth, Boston 
Chapter 10, pp. 205-236, which is incorporated herein by 
10 reference. 

Lower eukaryotes, e.g., yeasts and Dictyostelium, may be 
transformed with vectors encoding vertebrate prostaglandin 
transporters. For purposes of this invention, the most common 
lower eukaryotic host is the baker's yeast, Saccharomyces 
cerevisiae. it will be used to generally represent lower 
eukaryotes although a number of other strains and species are also 
available. Yeast vectors typically consist of a replication 
origin (unless of the integrating type) , a selection gene a 
promoter, DNA encoding the desired protein or its fragments, and 
sequences for translation termination, polyadenylation, and 
transcription termination. Suitable expression vectors for veast 
include such constitutive promoters as 3-phosphoglycerace kinase 
and various other glycolytic enzyme gene promoters or such 
inducible promoters as the alcohol dehydrogenase 2 promoter or 
!S metallothionine promoter. Suitable vectors include derivatives of 
the following types: self -replicating low copy number (such as the 
YRp-series) , self-replicating high copy number (such as the YEp- 
series); integrating types (such as the Yip-series) , or mini- 
chromosomes (such as the YCp-series) . 
0 Higher eukaryotic tissue culture cells are the preferred host 

cells for expression of the functionally active prostaglandin 
transporter. In principle, most higher eukaryotic tissue culture 
cell lines are workable, e.g., insect baculovirus expression 
systems, whether from an invertebrate or vertebrate source. 
5 However, mammalian cells are preferred, in that the processing, 
both cotranslationally and posttranslationally is more likely to 
simulate natural forms. Transformation or transfection and 
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propagation of such cells has become a routine procedure 
Examples of useful cell lines include HeLa cells, Chinese hamster 
ovary (CHO) cell lines, baby rat kidney (BRK) cell lines, insect 
cell lines, bird cell lines, and monkey (COS) cell lines 
Expression vectors for such cell lines usually include an origin 
of replication, a promoter, a translation initiation site. RNA 
splice sites (if genomic DNA is used) , a polyadenylation site, and 
a transcription termination site. These vectors also usually 
contain a selection gene or amplification gene. Suitable 
expression vectors n*y be plasmids, viruses, or retroviruses 
carrying promoters derived, e.g., f rom such sources as frQm 
adenovirus, SV40, parvoviruses, vaccinia virus, or 
cytomegalovirus. Representative examples of suitable expression 
vectors include pCDNAl; pCD, see Okayama et al . (1985) Mol. Cp~\ 1 
Biol^ 5:113 6-1142,- pMClneo Poly-A, see Thomas et al . (1987) Cell 
51:503-512; and a baculovirus vector such as pAC 373 or pAC 610. 

It will often be desired to express a DC-PGT, HDTEA84 
HSLJD37R, RANKL, HCC5, Dubll, Dubl2, MD-1, MD-2, or cyclin ' E2 
polypeptide in a system which provides a specific or defined 
glycosylate pattern. See, e.g., Luckow and Summers (1988) 
Bio/Techno) oqy 6:47-55; and Kaufman (1990) Meth. Bnzvmm 185:487- 
511. Preferred prokaryotic forms lack eukaryotic glycosylation 
patterns. However, the pattern will be modifiable by exposing the 
35 Polypeptide, e.g., an unglycosylated form, to appropriate 

25 glycosylating proteins introduced into a heterologous expression 
system. For example, the desired gene may be cotransf ormed with 
one or more genes encoding mammalian or other glycosylating 
enzymes. Using this approach, certain mammalian glycosylation 
patterns will be achievable or approximated in prokaryote or other 
30 cells. 

The DC-PGT, HDTEA84, HSLJD37R, RANKL, HCC5 , Dubll, Dubl2, MD- 
1, MD-2, or cyclin E2, or a fragment thereof, may be engineered to 
be phosphatidyl inositol (PI) linked to a cell membrane, but can 
be removed from membranes by treatment with a phosphatidyl 
inositol cleaving enzyme, e.g., phosphatidyl inositol 
phospholipase-C. This releases the antigen in a biologically 
active form, and allows purification by standard procedures of 
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protein chemistry. See, e.g., Low (1989) Biochim n^ rhv o ^ 
988:427-454; Tse, et al . (1985) Science 230:1003-1008; and 
Brunner, et al. (1991) j. C. P .M n n i 114:1275-1283. 

Transformed cells include cells, preferably mammalian, that 
have been transformed or transfected with vectors containing a 
prostaglandin transporter gene, typically constructed using 
recombinant DMA techniques. Transformed host cells usually 
express the antigen or its fragments, but for purposes of cloning 
amplifying, and manipulating its dna, do not need to express the 
protein. This invention further contemplates culturing 
transformed cells in a nutrient medium, thus permitting the 
protein, or soluble fragments, co accumulate in the culture 
Soluble protein can be recovered, either from the culture or from 
the culture medium, and membrane associated proteins may be 
15 prepared from suitable cell subf ractions . 

Now that the genes have been characterized, fragments or 
derivatives thereof can be prepared by conventional processes for 
synthesizing peptides. These include processes such as are 
described in Stewart and Young (1984) Solid Ph a *» p-^- 
0 Synthes i s , Pierce Chemical Co., Rockford, IL; Bodanszky and 
Bodanszky (1984) The PracMV P of P.nt.^ <y^^ - Springer . 
Verlag, New York; and Bodanszky (1984) The Prin^m^ of Pent: id- 
Syath£si£, springer -verlag, New York. For example, an azide 
process, an acid chloride process, an acid anhydride process a 
5 mixed anhydride process, an active ester process (for example p- 
mtrophenyl ester, N-hydroxysuccinimide ester, or cyanomethyl 
ester), a carbodiimidazole process, an oxidative-reductive 
process, or a dicyclohexylcarbodiimide (DCCD) /additive process can 
be used. Solid phase and solution phase syntheses are both 
) applicable to the foregoing processes. 

The proteins, fragments, or derivatives are suitably prepared 
in accordance with the above processes as typically employed in 
peptide synthesis, generally either by a so-called stepwise 
process which comprises condensing an amino acid to the terminal 
> amino acid, one by one in sequence, or by coupling peptide 

fragments to the terminal amino acid. Amino groups that are not 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/1ZJM 



73 



10 



15 



20 



10 



15 



being used in the coupling reaction are typically protected to 
prevent coupling at an incorrect location. 

If a solid phase synthesis is adopted, the C-terminal amino 
acid is bound to an insoluble carrier or support through its 
carboxyl group. The insoluble carrier is not particularly limited 
as long as it has a binding capability to a reactive carboxyl 
group. Examples of such insoluble carriers include halomethyl 
resins, such as chloromethyl resin or bromomethyl resin, 
hydroxymethyl resins, phenol resins, tert-alkyloxycarbonyl- 
hydrazidated resins, and the like. 

An amino group-protected amino acid is bound in sequence 
through condensation of its activated carboxyl group and the 
reactive amino group of the previously formed peptide or chain to 
synthesize the peptide step by step. After synthesizing the 
complete sequence, the peptide is split off from the insoluble 
25 carrier to produce the peptide. This solid -phase approach is 

generally described by Merrifield et al . (1963) in J. Am. Chgm . 
So^ 85:2149-2156, which is incorporated herein by reference. 

The prepared protein and fragments thereof can be isolated 
and purified from the reaction mixture by means of peptide 
separation, e.g., by extraction, precipitation, electrophoresis 
and various forms of chromatography, and the like. The proteins 
of this invention can be obtained in varying degrees of purity 
35 depending upon its desired use. Purification can be accomplished 

by use of the protein purification techniques disclosed herein or 
by the use of the antibodies herein described in immunoabsorbent 
affinity chromatography. This immunoabsorbent affinity 
chromatography is carried out by first linking the antibodies to a 
solid support and then contacting the linked antibodies with 
solubilized lysates of appropriate source cells, lysates of other 
cells expressing the protein, or lysates or supernatants of cells 
producing the desired protein as a result of DNA techniques, see 
below. Detergents may be necessary to include in the methods to 
maintain protein solubility. 
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The present invention provides reagents which will find use 
in diagnostic applications as described elsewhere herein eg in 
the general description for cell mediated conditions, or below'in 
the description of kits for diagnosis. The genes will be useful 
xn forensic analyses, e.g., to identify species, or to diagnose 
different cell subsets or types. 

If DC-PGT is used to clear prostaglandins (PCs) and other 
metabolically active organic anions from the body (in the liver 
fetal liver, lung and placenta) it is easy to suppose that an 
alteration in the capacity of this mechanism could augment the 
allergic response. Prostaglandin PGF 2a and PGD 2 , and PGG 2 and 
thromboxane A 2 can cause airway obstruction, particularly in the 
peripheral lung, while PGE 2 and PGI 2 are bronchodilators . Use of 
the transporter of the invention could help transport or remove 
15 these prostaglandins to modulate airway obstruction. 

Additionally, prostaglandins play an important role in 
secondary immunosuppression seen following surgical stress 
Alexander (1990) Z^^a 30:S70; Faist, et al. (1987) J. Trannu, 
27:837; Ninneman, et al. (1984) J. Trauma 24:201; Wood, et al . 
0 (1987) Arch. S„rq . 122:179; Polk, et al. in Eremin and Sewell 
(6dS - 1992) ^ PnmunmocTical Rfl a j g ft f p^g^T <y^ Q ^ 
praCtice ' <*£or<l u - Press. m particular, PGE2 inhibits 
lymphocyte proliferation, decreases IL- 2 release, decreases 
response to il-2, inhibits natural killer cells, and activates 
5 supressor cells. Major injury has been shown to result in 

increased production of PGE2 from inhibitory macrophages, with a 
resulting decrease in production of IL-1 and IL-2. This effect 
may persist for 7 to 10 days after major injury. Other studies 
have shown no increase in circulating PGE2 following burns but did 
0 fmd increased local production with increased sensitivity of 
lymphocytes to the action of PGE2 . 

Prostaglandin E2, through locally produced vasodilator/ 
effects, may play a role in rheumatoid arthritis by promoting the 
entry of inflammatory cells into the joint. Once in the synovial 
5 fluid, polymorphonuclear leukocytes can ingest immune complexes, 
which, in turn, cause neutrophils to produce reactive oxygen 
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metabolites and other inflammatory mediators to further enchance 
an inflammatory cascade. Henson. et al . (1987) J. Clin m^- 
79:693. 

Accordingly, it is possible to use the present invention to 
5 modulate prostaglandins in a subject suffering from trauma, 
injury, disease or in post-surgical treatments. 

Immune system cells may be synthesizing PGs and thus using 
DC-PGT in an efflux role for removing PGs from the intracellular 
space may be useful. Equally, DC-PGT might transport a specific 
10 organic anion. Abnormal proliferation, regeneration, 

degeneration, and atrophy may be modulated by appropriate 
therapeutic treatment using the compositions provided herein For 
example, a disease or disorder associated with abnormal function 
of a prostaglandin transporter should be a likely target for a 
substrate or blocking substrate. Alternatively, the transporter 
may be a useful means for supplying important metabolites or 
metabolite blockers to the respective cells. 

For example, transformation with the transporter may increase 
availability of the substrate to the cell, m certain situations 
a prostaglandin analog might be advantageously supplied to the 
cell. The prostaglandin analog might confer high susceptibility 
to further treatment, e.g., radiation sensitivity or otherwise, or 
may directly affect normal metabolism, e.g., nucleic acid related 
enzymes. Alternatively, the transporter may be useful to screen 
for antagonists or inhibitors, which might be effective in 
blocking the normal availability to the cell of the natural 
substrate. Screening methods for such prostaglandin analogs are 
provided . 

Screening using prostaglandin transporter for binding 
metabolites or compounds having binding affinity to the 
transporter can be performed, including isolation of associated 
components. Subsequent biological assays can then be utilized to 
determine if the compound has intrinsic biological activity and is 
therefore an agonist or antagonist in that it blocks an activity 
of the transporter. In particular, prostaglandin analogs may be 
useful in blocking binding of the natural target or otherwise 
blocking transporter activity. Alternatively, various other 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/12366 



76 



10 



15 



10 

20 



analogs may be useful in blocking an ion transporter, or organic 
anion source. This invention further contemplates the therapeutic 
use of antibodies to prostaglandin transporter as antagonists. 
This approach should be particularly useful with other 
prostaglandin transporter species variants and other members of 
the family. 

Antagonists of the transporter activity, e.g., antibodies 
which block the transport, may be useful in various medical 
conditions. These would include immune, inflammatory or allergic 
abnormalities, all of which are important where transfer of 
organic anions take place. Certain congenital diseases of 
prostaglandin physiology will be susceptible to such a therapeutic 
approach . 

The HDTEA84, HSLJD37R , RANKIi, HCC5 , MD-1, or MD-2 (naturally 
15 occurring or recombinant) , fragments thereof, and antibodies 
25 thereto, along with compounds identified as having binding 

affinity to them, should be useful in the treatment of conditions 
associated with abnormal physiology or development, including 
abnormal proliferation, e.g., cancerous conditions, or 
20 degenerative conditions, m particular, modulation of development 
of lymphoid cells will be achieved by appropriate therapeutic 
treatment using the compositions provided herein. For example, a 
disease or disorder associated with abnormal expression or 
abnormal signaling by a ligand or receptor should be a likely 
target for an agonist or antagonist of the antigen. The antigen 
plays a role in regulation or development of hematopoietic cells, 
e.g., lymphoid cells, which affect immunological responses, e.g.,' 
autoimmune disorders . 

In particular, the antigen may provide a costimulatory signal 
to cell activation, or be involved in regulation of cell 
proliferation or differentiation. Thus, the HDTEA84 , HSLJD37R, 
45 RANKL ' HCC5 ' MD " 1 ' or M 0 - 2 wi H likely modulate cells which 

possess a receptor therefor, e.g., T cell mediated interactions 
with other cell types. These interactions would lead, in 
particular contexts, to modulation of cell growth, cytokine 
synthesis by those or other cells, or development of particular 
effector cells. 
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Moreover, the HDTEA84, HSLJD37R, RANKL, HCC5, MD-1 or MD-2 
or antagonists could redirect T cell responses, e.g., between Thl 
and Th2 polarization, or with ThO cells, or may affect B cells or 
other lymphoid cell subsets. Among these agonists should be 
various antibodies which recognize the appropriate epitopes, e g 
which mimic binding of i igand or reC e Pt or to its partner 
Alternatively, they may bind to epitopes which sterically can 
block receptor binding. Bone morphogenesis may be regulated by 
these receptor segments. 

The ligands or receptors may provide a selective and powerful 
way to modulate immune responses in abnormal situations e g 
autoimmune disorders, including rheumatoid arthritis, systemic 
lupus erythemacosis (SLE) , Hashimoto's autoimmune thyroiditis as 
well as acute and chronic inflammatory responses in which T cell 
activation, expansion, and/or immunological T cell memory play an 
important role. See also Samter, et al. (eds, Inmur^logJ^l 
Creases vols. 1 and 2, Little, Brown and Co. Regulation of bone 
morphogenesis, T cell activation, expansion, and/or cytokine 
release by the naturally occurring secreted form of HDTEA84 
HSLJD37R, rankl, HCC5, MD-1, or MD-2, or an antagonist thereof 
may be effected. 

In addition, certain combination compositions with other 
modulators of signaling would be useful, especially with the TNF 
receptor-like genes. Such other signaling molecules might 
25 mclude, e.g., TCR reagents, CD40, CD40L, CTLA-8, CD28, SLAM fas 
osteoprotegerin, and their respective antagonists, including 
antibodies . 

Cyclin E2 nucleotides, e.g., human E2 DNA Qr ^ ^ 

be used as a component in a forensic assay. For instance, the 
nucleotide sequences provided may be labeled using, e.g., 32 P or 
biotin and used to probe standard restriction fragment 
polymorphism blots, providing a measurable character to aid in 
distinguishing between individuals. Such probes may be used in 
well-known forensic techniques such as genetic fingerprinting. m 
addition, nucleotide probes made from cyclin E2 sequences may be 
used in in situ assays to detect chromosomal abnormalities. For 
instance, rearrangements in the human chromosome encoding a cyclin 
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E2 gene may be detected via well-known in situ techniques, using 
cyclm E2 probes in conjunction with other known chromosome 
markers. The cyclin E2 gene may have useful prognostic utility in 
various cancers, e.g., breast, etc. 

Antibodies and other binding agents directed towards cyclin 
E2 proteins or nucleic acids may be used to purify the 
corresponding cyclin E2 molecule. ^ described in the Examples 
below, antibody purification of cyclin E2 protein components is 
both possible and practicable. Antibodies and other binding 
agents may also be used in a diagnostic fashion to determine 
whether cyclin E2 protein components are present in a tissue 
sample or cell population using well-known techniques described 
herein. The ability to attach a binding agent to a cyclin E2 
protein provides a means to diagnose disorders associated with 
15 cyclin E2 protein misregulation. Antibodies and other cyclin E2 
protein binding agents may also be useful as histological markers 
As described in the examples below, cyclin E2 protein expression ' 
x- limited to specific tissue types. By directing a probe, such 
as an antibody or nucleic acid to a cyclin E2 protein it is 
20 possible to use the probe to distinguish tissue and cell types in 
situ or in vitro. 

This invention also provides reagents with significant 
therapeutic value. The cyclin E2 protein (naturally occurring or 
recombinant), fragments thereof, and antibodies thereto, along 
with compounds identified as having binding affinity to a cyclin 
E2 protein, can be useful in the treatment of conditions 
associated with abnormal physiology or development, including 
abnormal proliferation, e.g., cancerous conditions, or 
degenerative conditions. Abnormal proliferation, regeneration, 
degeneration, and atrophy may be modulated by appropriate 
therapeutic treatment using the compositions provided herein. For 
example, a disease or disorder associated with abnormal expression 
or abnormal signaling by a cyclin E2 protein is a target for an 
agonist or antagonist of the protein. The proteins likely play a 
role in regulation or development of neuronal or hematopoietic 
cells, e.g., lymphoid cells, which affect immunological responses 
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Various abnormal conditions are known in each of the cell 
types shown to possess, e.g., HDTEA84, mRNA by Northern blot 
analysis. See Berkow (ed.) The M er cR Manual of ™» m ~<~ ^ 
IheraEY, Merck & Co., Rahway, NJ; Thorn, et al. Harri.^n^ 
5 Principles of in^mfl? Merino , McGraw-Hill, NY; and Weatherall, 
et al. (eds.) Oxford Tpyfhook nf VW.^ ir - oxford University 
Press, Oxford. Many other medical conditions and diseases involve 
T cells or are T cell mediated, and many of these may be 
responsive to treatment by an agonist or antagonist provided 
10 herein. See, e.g., stites and Terr (eds; 1991, Basic m « m ^<„ r 
Immunology Appleton and Lange, Norwalk, CT; and Samter, et al. 
(«3s) iMolgor^ al ni^po Little, Brown and Co. These problems 
should be susceptible to prevention or treatment using 
compositions provided herein. 

Specific, or selective, antibodies can be purified and then 
administered to a patient, veterinary or human. These reagents 
can be combined for therapeutic use with additional active or 
inert ingredients, e.g., in conventional pharmaceutical^ 
acceptable carriers or diluents, e.g., immunogenic adjuvants 
along with physiologically innocuous stabilizers, excipients or 
preservatives. These combinations can be sterile filtered and 
placed into dosage forms as by lyophilization in dosage vials or 
storage in stabilized aqueous preparations. This invention also 
contemplates use of antibodies or binding fragments thereof, 
including forms which are not complement binding. 

Drug screening using proteins or fragments thereof can be 
performed to identify compounds having binding affinity to or 
other relevant biological effects on antigen functions, including 
isolation of associated components. Subsequent biological assays 
can then be utilized to determine if the compound has intrinsic 
stimulating activity or is a blocker or antagonist in that it 
blocks the activity of the antigen, e.g.. mutein antagonists. 
Likewise, a compound having intrinsic stimulating activity can 
activate the signal pathway and is thus an agonist in that it 
overcomes any blocking activity of these soluble forms of 
receptors. This invention further contemplates the therapeutic 
use of blocking antibodies to ligands or receptors as agonists or 
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antagonists and of stimulatory molecules, e.g., muteins, as 
agonists. This approach should be particularly useful with other 
soluble receptor species variants. 

Another therapeutic approach included within the invention 
involves direct administration of reagents or compositions by any 
conventional administration techniques (e.g., but not restricted 
to local injection, inhalation, or administered systemically) 
to the subject with an immune, allergic, or trauma disorder The 
reagents, formulations, or compositions included within the bounds 
and metes of the invention may also be targeted to specific cells 
or transporters by methods described herein. The actual dosage of 
reagent, formulation, or composition that modulates an immune 
allergy, or trauma disorder depends on many factors, including 
the size and health of an organism, however one of ordinary skill 
m the art can use the following teachings describing the methods 
and techniques for determining clinical dosages. See e g 
Spilker (1984) Guide to rUnicl ^ pev.lnnin. ^J^^ 

Raven Press Books, Ltd., New York, pp. 7-13, 54-60; Spilker (1991) 
Guide to Clinical Triq lB Raven Press, Ltd., New York, pp. 93-101- 
Craig and stitzel (eds. 1985) Modern Ph.n^m^, 2 d ed. , Little' 
Brown and Co., Boston, pp. 127-33; Speight (ed. 1987) Avery-.. 

Treatment; Principle fl nH Pract irp of m ining Ph a ^~o~ T i . nni 

a Hwrapgutic , 3d ed.. Williams and Wilkins, Baltimore, pp -0-56- 
Tallarida, et al . (1988) Princ ipT e P in r+ner*, pv^^^ 
Spnnger-Verlag, New York, pp. i 8 - 20 , . Generally, the dose will 
be in the range of about between 0.5 fg/ m l and 5 00 ng/ m l, 

inclusive, final concentration administered per day to an adult in 

a pharmaceutically acceptable carrier. 

The quantities of reagents necessary for effective therapy 
Will depend upon many different factors, including means of 
administration, target site, physiological state of the patient 
and other medicants administered. Thus, treatment dosages should 
be titrated to optimize safety and efficacy. Typically, dosages 
used in vitro may provide useful guidance in the amounts useful 
for in situ administration of these reagents. Animal testing of 
effective doses for treatment of particular disorders will provide 
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further predictive indication of human dosage. Various 
considerations are described, e.g.. in Oilman, et al. (eds. 1990) 
Goodman and r T j W«,- ^ PHarmaco] ocical R„^c o f Th^o, lM ^ 
8th Ed., Pergamon Press; and Re m j n g ron ■ «, Pharmacol c„ i r „__' 
17th ed. (1990). Mack Publishing Co., Easton, Penn. Methods for 
administration are discussed, e.g., for oral , intravenous 
intraperitoneal, or intramuscular administration, transdermal 
diffusion, and others. Pharmaceutical^ acceptable carriers will 
include water, saline, buffers, and other compounds described 
e.g., xn the Mer^^nd^, Merck & Co., Rahway, New J erse y. Dosage 
ranges would ordinarily be expected to be in amounts lower than 1 
mM concentrations, typically less than about 10 um concentrations 
usually less than about 100 nM, preferably less than about 10 P M ' 
(picomolar) . and most preferably less than about 1 f M 
15 (femtomolar), with an appropriate carrier. Slow release 

formulations, or a slow release apparatus will often be utilized 
for continuous or long term administration. See, e.g., Langer 
(1990) Science 249:1527-1533. 

Ligands, receptors, enzymes, fragments thereof, and 
0 antibodies to it or its fragments, antagonists, and agonists, may 
be administered directly to the host to be treated or, depending 
on the size of the compounds, it may be desirable to conjugate 
them to carrier proteins such as ovalbumin or serum albumin prior 
to their administration. Therapeutic formulations may be 
5 administered in many conventional dosage formulations, while it 
is possible for the active ingredient to be administered alone, it 
18 Parable to present it as a pharmaceutical formulation 
Formulations typically comprise at least one active ingredient, as 
defined above, together with one or more acceptable carriers 
0 thereof. Each carrier should be both pharmaceutically and 

physiologically acceptable in the sense of being compatible with 
the other ingredients and not injurious to the patient. 
Formulations include those suitable for oral, rectal, nasal, 
topical, or parenteral (including subcutaneous, intramuscular, 
5 intravenous and intradermal) administration. The formulations may 
conveniently be presented in unit dosage form and may be prepared 
by methods well known in the art of pharmacy. see, e.g., Gilman, 
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et al. (eds. 1990) Gopdman anri Gili^n^. ^ e Pharm^^^, 
gases of Therflmiti rp, 8th Ed., Pergamon Press; and Remington -C 
Pharmaceutica l Sc iences, 17th ed. (1990), Mack Publishing Co., 
Easton. Penn.; Avis, et al. (eds. 1993) Phama^.t^,-,! n»,.^' 
5 ParRn^era l Meq-jcatiom , Dekker, New York; Lieberman, et al. 

(eds. 1990) Pharmaceutical Do sage Form s: Tables . Dekker, New 
York; and Lieberman, et al. (eds. 1990) Pharmaceutic^ 
Forms: Disperse Systems , Dekker, New York. The therapy of this 
invention may be combined with or used in association with other 
10 agents, e.g.. other modulators of cell activation, e.g., CD40 
CD40 ligand, CD28, CTLA-4, B7, B70, SLAM, T cell receptor 
signaling entities, or their respective antagonists. 

Both the naturally occurring and the recombinant forms of the 
proteins of this invention are particularly useful in kits and 
assay methods which are capable of screening compounds for binding 
activity to the proteins. Several methods of automating assays 
have been developed in recent years so as to permit screening of 
tens of thousands of compounds in a short period. See, e g 
Fodor, et al. (1991) Science 251:767-773, which describes means 
0 for testing of binding affinity by a plurality of defined polymers 
synthesized cn a solid substrate. The development of suitable 
assays can be greatly facilitated by the availability of large 
amounts of purified, soluble proteins or nucleic acids as provided 
by this invention. 
5 Other methods can be used to determine the critical residues 

in the substrate, ligand, or receptor binding interactions. 
Mutational analysis can be performed, e.g., see Somoza, et al. 
(1993) J . Exp, Mff l, 178:549-558, to determine specific residues 
critical in the interaction and/or signaling. This will allow 
0 study of both extracellular domains, involved in the soluble 
ligand interaction, or intracellular domain of a transmembrane 
form, which provides interactions important in intracellular 
signaling. 

For example, antagonists can normally be found once the 
antigen has been structurally defined, e.g., by tertiary structure 
data. Testing of potential interacting analogs is now possible 
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upon the development of highly automated assay methods using a 
purified protein, m particular, new agonists and antagonists 
will be discovered by using screening techniques described herein 
Of particular importance are compounds found to have a combined 
binding affinity for a spectrum of protein molecules, e g 
compounds which can serve as antagonists for species variants of 
the antigens . 

One method of drug screening utilizes eukaryotic or 
prokaryotic host cells which are stably transformed with 
recombinant DNA molecules expressing desired protein. Cells may 
be isolated which express a selected protein in isolation from 
other molecules. Such cells, either in viable or fixed form can 
be used for standard binding partner binding assays . See also 
Parce, et al. (1989) Science 246:243-247,- and Owicki, et al . 
{1 " 0) Fr ° C - Nar ' 1 A ^ d - ^ "SA 87:4007-4011, which describe 
sensitive methods to detect cellular responses. 

Another technique for drug screening involves an approach 
which provides high throughput screening for compounds having 
suitable binding affinity to a desired target protein, and is 
described in detail in Geysen, European Pa ten= Application 
84/03564, published on September 13, 1984. First, large numbers 
of different small peptide test compounds are synthesized on a 
solid substrate, e.g., plastic pins or some other appropriate 
surface, see Fodor, et al . (1991 , . Then the p±ns are ^ 
solubxlized, unpurified or solubilized, purified target protein 
and washed. The next step involves detecting bound protein. 

Rational drug design may also be based upon structural 
studies of the molecular shapes of the protein and other effectors 
or analogs. Effectors may be other proteins which mediate other 
functions in response to binding, or other proteins which normally 
interact. One means for determining which sites interact with 
specific other proteins is a physical structure determination, 
e.g., x-ray crystallography or 2 dimensional NMR techniques. 
These will provide guidance as to which amino acid residues form 
molecular contact regions. For a detailed description of protein 
structural determination, see, e.g., Blundell and Johnson (1976) 
Protein Crystallography , Academic Press, New York. 
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This invention also contemplates use of the proteins 
fragments thereof, peptides, and their fusion products in a 
5 vanety of diagnostic kits and methods for detecting, e g the 
presence of protein or binding partner. Typically the kit will 
have a compartment containing either a described polypeptide or 
gene segment or a reagent which recognizes one or the other e g 
fragments or antibodies. Alternatively, kits may be nucleic acid ' 
J-f cased. 

A kit for determining the binding affinity of a test compound 
to. e.g., an HDTEA84, would typically comprise a test compound; a 
labeled compound, for example a binding parser or antibody having 
known binding affinity for HDTEA84 ; a source of HDTEA84 (naturally 
15 occurring or recombinant,; and a means for separating bound from 
free labeled compound, such as a solid phase for immobilizing the 
molecule. Once compounds are screened, those having suitable 
brndxng affinity to the antigen can be evaluated in suitable 
biological assays, as are well known in the art, to determine 
0 whether they act as agonists or antagonists to the HDTEA84 
Signaling pathway. The availability of recombinant HDTEA84 
polypeptides also provide well defined standards for calibrating 
such assays . 

A preferred kit for determining the concentration of e g 
5 an HDTEA84 in a sample would typically comprise a labeled 

compound, e.g., binding partner or antibody, having known binding 
affinity for the antigen, a source of antigen (naturally occurring 
or recombinant) and a means for separating the bound from free 
labeled compound, e.g., a so i id phase for ilranobili2irlg ^ 

3 HDTEA84 Compartments containing reagents, and instructions, will 
normally be provided. 

Antibodies, including antigen binding fragments, specific 
for, e.g., the HDTEA84 or fragments, are useful in diagnostic 
applications to detect the presence of elevated levels of HDTEA84 
and/or its fragments. Such diagnostic assays can employ lysates, 
live cells, fixed cells, immunofluorescence, cell cultures, body 
fluids, and further can involve the detection of antigens related " 
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to the antigen in serum, or the like. Diagnostic assays may be 
homogeneous (without a separation step between free reagent and 
antigen-binding partner complex) or heterogeneous (with a 
separation step) . Various commercial assays exist, such as 
5 radioimmunoassay (RIA) , enzyme-linked immunosorbent assay (ELISA) 
enzyme immunoassay (EIA) , enzyme -multiplied immunoassay technique' 
(EMIT) , substrate-labeled fluorescent immunoassay (SLFIA) , and the 
15 llkG - See ' e - g "' Van Vunakis, et al . (1980) M^h^nzymol^ 70:1- 

525; Harlow and Lane (1980) A ntibodies: A Lahnr^n ry Wanlla i CSH 
Press, NY; and Coligan. et al. (eds. 1993) Current tw^i . H 
Immunolnrry Greene and Wiley, Ny. 

Anti-idiotypic antibodies may have similar use to diagnose 
presence of antibodies against a described protein, as such may be 
diagnostic of various abnormal states. Overproduction of 
prostaglandin transporter may reflect various medical conditions, 
which may be diagnostic of abnormal physiological states, 
particularly in proliferative cell conditions such as cancer or 
abnormal differentiation. For example, leukemias and lymphomas 
may exhibit altered transporter expression, which may reflect 
their altered physiology and may provide means to selectively 
target. Alternatively, overproduction of HDTEA84 , HSLJD37R 
RANKL, HCC5, MD-1, or MD-2 may result in production of various 
immunological reactions which may be diagnostic of abnormal 
physiological states, particularly in proliferative cell 
25 conditions such as cancer or abnormal activation or 

differentiation. Expression levels of dc-PGT, Dubs, or cyclin E2 
may likewise be diagnostic of specific therapeutic conditions, 
advantageous or disadvantageous. 

Frequently, the reagents for diagnostic assays are supplied 
in kits, so as to optimize the sensitivity of the assay. For the 
subject invention, depending upon the nature of the assay, the 
protocol, and the label, either labeled or unlabeled antibody or 
binding partner, or labeled HDTEA84 is provided. This is usually 
in conjunction with other additives, such as buffers, stabilizers, 
materials necessary for signal production such as substrates for 
enzymes, and the like. Preferably, the kit will also contain 
instructions for proper use and disposal of the contents after 
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Typically the kit has compartments for each useful reagent 
Desirably, the reagents are provided as a dry lyophilized powder' 
where the reagents may be reconstituted in an aqueous medium 
providing appropriate concentrations of reagents for performing 
the assay. 

Many of the aforementioned constituents of the drug screening 
and the diagnostic assays may be used without modification or may 
be modxfied in a variety of ways. For example, labelino may be 
achieved by covalently or non-covalently joining a moiety which 
directly or indirectly provides a detectable signal. i n these 
assays, the binding partner, test compound, HDTEA84, or antibodies 
thereto can be labeled either directly or indirectly. 
Possibilities for direct labeling include label groups: 
radiolabels such as 125 If enzymes «j.s. Pat. No. 3,645,090) such 
as peroxidase and alkaline phosphatase, and fluorescent labels 
(U.S. Pat. No. 3,940,475) capable of monitoring the change in 
fluorescence intensity, wavelength shift, or fluorescence 
polarization. Possibilities for indirect labeling include 
biotinylation of one constituent followed by binding to avidin 
coupled to one of the above label groups. 

There are also numerous methods of separating the bound from 
the free polypeptide, or alternatively the bound from the free 
test compound. The polypeptide can be immobilized on various 
matrixes followed by washing. Suitable matrices include plastic 
such as an ELISA plate, filters, and beads. See, e.g., Coligan, 
et al. (eds. 1993) Current Protocols in Tn™„^i „ T| , Vol _ x 
Chapter 2, Greene and Wiley, NY. other suitable separation 
techniques include, without limitation, the fluorescein antibody 
magnetizable particle method described in Rattle, et al. (1984) 
C lAn, Chem. 30:1457-1461, and the double antibody magnetic 
particle separation as described in U.S. Pat. No. 4,659,678. 

Methods for linking proteins or their fragments to the 
various labels have been extensively reported in the literature 
and do not require detailed discussion here. Many of the 
techniques involve the use of activated carboxyl groups either 
through the use of carbodiimide or active esters to form peptide 
bonds, the formation of thioethers by reaction of a mercapto group 
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with an activated halogen such as chloroacetyl. or an activated 
olefin such as maleimide, for linkage, or the like. Fusion 
proteins will al so find use in these applications. 

Another diagnostic aspect of this invention involves use of 
olxgonucleotide or polynucleotide sequences taken from the 
sequence of a described protein. These sequences can be used as 
probes for detecting levels of the message in samples from 
patients suspected of having an abnormal condition, e.g cancer 
or developmental problem, since the antigen is a marker for 
activation, it may be useful to determine the numbers of activated 
T cells to determine, e.g., when additional suppression may be 
20 Called for - The Preparation of both RNA and DNA nucleotide 

sequences, the labeling of the sequences, and the preferred size 
of the sequences has received ample description and discussion in 
15 the literature. See, e.g., Langer-Saf er, et al . (1982) Proc . 
25 Nat'l. Acad. Sri 79:4381-4385; Caskey (1987) Science 236:962-967; 

and Wilchek, et al . (1988) Anal. Binrh^ 171.-1-32. 

Alternatively, antibodies may be employed which can recognize 
specific duplexes, including DNA duplexes, RNA duplexes, DNA-RNA 
hybrid duplexes, or DNA-protein duplexes. The antibodies in turn 
may be labeled and the assay carried out where the duplex is bound 
to a surface, so that upon the formation of duplex on the surface 
the presence of antibody bound to the duplex can be detected. The 
use of probes to the novel anti-sense RNA may be carried out in 
any conventional techniques such as nucleic acid hybridization, 
plus and minus screening, recombinational probing, hybrid released 
translation (HRT) , and hybrid arrested translation (HART) . This 
also includes amplification techniques such as polymerase chain 
reaction (PCR) . 

Diagnostic kits which also test for the qualitative or 
quantitative presence of other markers are also contemplated. 
45 Diagnosis or prognosis may depend on the combination of multiple 

indications used as markers. Thus, kits may test for combinations 
of markers. See. e.g., Viallet, et al. (1989) Proore^ Sn a™.^ 
35 Factor Res , 1:89-97. Other kits may be used to evaluate T cell 
50 subsets . 
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X. Methods for Isolating Substrates/Specific Partners 

The DC-PGT should interact with its substrate target The 
substrate will be similar to the organic molecules which are 
subject to transport. The Dubs and cyclin E2 will also be 
screened for substrate identification. 

The HDTEA84, HSUD37R, and RANKL protein should interact with 
a TNF Ixgand, based, e.g., upon its similarity in structure and 
function to other cell surface antigens exhibiting similar 
structure and cell type specificity of expression. The MD-1 and 
MD-2 antigens are related to known proteins, which interact with B 
cell antigens. Methods to isolate a ligand are made available by 
the ability to make purified protein for screening programs 
Similar techniques will be applicable to the HCC5 chemokine, and 
the MD-1 and MD-2 surface receptors. 

Sequences provided herein will allow for screening or 
isolation of specific ligands. Many methods exist for expression 
cloning, panning, affinity isolation, or other means to identify a 
Ixgand. A two-hybrid selection system may also be applied making 
approprxate constructs with the available sequences, as 
appropriate. See, e.g.. Fields and Song (1989) Nature 340:245- 
246. 

The broad scope of this invention is best understood with 
reference to the following examples, which are not intended to 
limit the invention to specific embodiments. 



SUBSTITUTE SHEET ( rule 26 ) 



10 



15 



0 



WO 00/01817 PCT/US99/12366 

89 

EXAMPLES 

General Methods 

Some of the standard methods are described or referenced, 
e.g., inManiatis, et al. (1982) Molecular cloning, * r-aw^. 
Manual, cold Spring Harbor Laboratory. Cold Spring Harbor Press; 
Sambrook. et al. (1989) Mol ecular Honing: A r. a w a , fcOTV Mama1 ' {2 d 
ed.), vols. 1-3, CSH Press, ny ; AttBubal . et al ., j^^, Qreene 
Publishing Associates, Brooklyn, NY,,- or Ausubel, et al. (1987 and 
Supplements) Current Protocols in Molecu lar Biology Gree ne and 
Wiley, New York; Innis, et al . (eds. 1990) PGR Proton- » 
to Methods and App lications, Academic Press, N.Y. Methods for 
protein purification include such methods as ammonium sulfate 
precipitation, column chromatography, electrophoresis, 
centrifugation, crystallization, and others . See, e.g., Ausubel, 
et al. (1987 and periodic supplements); Deutscher (1990) "Guide to 
Protein Purification" in Methods in Rn^i vo l. la2 , and other 
volumes in this series; and manufacturer's literature on use of 
protein purification products, e.g., Pharmacia, Piscataway, N.J. , 
or Bio-Rad, Richmond, CA. Combination with recombinant techniques 
allow fusion to appropriate segments, e.g., to a FLAG sequence or 
an equivalent which can be fused via a protease-removable 
sequence. See, e.g.. Hochuli (1990) 'Purification of Recombinant 
Proteins with Metal Chelate Absorbent" in Setlow (ed.) Genetic 
Engineering, Principle and Mpi-h^c 12:87-98, Plenum Press, N.Y.; 
and Crowe, et al . (1992) gJA express : The High Expression I 

Protein Purification , S y ptem QIAGEN, Inc., Chatsworth, CA. Cell 
culture techniques are described in Doyle, et al. (eds. 1994) Cell 
a nd Tissue Culture; T la borator y Pr ocedure John Wiley and Sons, 
NY. 

Standard immunological techniques are described, e.g., in 
Hertzenberg, et al. (eds. 1996) Weir's Windhoek of p^^-L^ 
Immunol oqy vols. 1-4, Blackwell Science; Coligan (1991) Current 
Protocols in Immunol op y. Wiley /Greene, NY; and Methods in 
EnzvmQloqy volumes. 70, 73, 74, 84, 92, 93, 108, 116, 121, 132, 
150, 162, and 163. 
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FACS analyses are described in Melamed, et al . (19 90 ) F i ow 
Cytometry .Sorrina Wiley-Liss, Inc.. New York, NY; Shapiro 
(1988) Practical How PyrmnPfry Liss, New York, NY; and Robinson, 
et (1 " 3) Handb °<* of Flow rv^^ w^gy-Mss, New' 

York, NY. Fluorescent labeling of appropriate reagents was 
performed by standard methods. 

The FASTA (Pearson and Lipman, 1988) and BLAST (Altschul et 
al. (1990) J. Mol. B ioi^ 215:403-410) programs were used to comb 
nonredundant protein and nucleotide databases (Benson, et al 
(1994) Nucl. Aei< , s K ^ 22:3441-3444; Bairoch and Boeckmann (1994) 
Nucl. Acids Re, 22:3578-3580, with the resultant cDNA and encoded 
protein sequences. The sensitive search strategies of Altschul 
€t al - (1994) Nature G enets 6:119-129; and Koonin. et al (1994,' 
Em^L. 13:493-503; served as examples of how to locate distant 
structural homologues of protein chains. Multiple alignments of 
collected homologues were carried out with ClustalW (Thompson, et 
al - (1 " 4) Comp - A PP T ^- Bio pri 10.-19-29) and MACAW (Schuler, et 
al. (1991) Proteins 9:180-190). 

The membrane topologies of proteins, e.g., DC-PGT, and a 
cohort of putative homologues were analyzed by a variety of 
methods that sought to determine the consensus number of domains, 
e.g., hydrophobic membrane-spanning helices and the likely 
cytoplasmic or extracellular exposure of the hydrophilic 
connecting loops. For single sequence analysis, the ALOM and MTOP 
(Klein, et al. (1985) Biochim. *i np y 815:468-476; and 

Hartmann, et al. (1989) Prop, Nat.T . flrad. Sri rrea 86:5786-5790) 
programs were accessed from the PSORT World-Wide Web site (Nakai 
and Kanehisa (1991) Proteins 11:95-110; and Nakai and Kanehisa 
(1992) Senomics 14:897-911); in turn, the TopPredH program 
(Claros and von Heijne £1994) Comp. Ann! ,V 10:685-686; 
Maclntosh PPC version) was used to parse chains into probable ' 
hydrophobic transmembrane and loop regions of DC-PGT, and further 
predict the localization of these latter regions by prevalence 
charged residue types (von Heijne (1992) J. Mol. mm 225:487- 
494; and Sippos and von Heijne (1993) Eur. J. Binrh T 213:1333- 
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1340). MEMSAT (Jones, et al . (1994) Biochgm . 33:3038-3049; MS-DO; 
PC version) was likewise used to fit individual sequences into 
statistically-based topology models that render judgment on 
membrane spanning and loop chain segments. Two Web-accessible 
programs that are able to make use of evolutionary data by 
analyzing multiply aligned sequences are PHD (Rost, et al. (1994, 
Comp, Applir Riosri 10:53-60; and Rost. et al . (1995) PrntPin 
4:521-533) and TMAP (Persson and Argos (1994) J. MoT Ri^ 
237:182-192); the former utilizes a neural network system to 
accurately predict the shared location of helical transmembrane 
segments in a protein family. similar analysis of other proteins 
can be performed. 

I. Generation of Dendritic Cells 

Human CD34+ cells are obtained as follows. See, e.g., Caux, 
et al. (1995) pages 1-5 in Banchereau and Schmitt Dendrite rv>n I 
i n FundamPntal and Clinical t,,^!^ Plenum Press, NY. 
Peripheral or cord blood cells, sometimes CD34+ selected, are 
cultured in the presence of Stem Cell Factor (SCF) , GM-CSF, and 
TNF-a in endotoxin free RPHI 1640 medium (GIBCO, Grand Island, NY) 
supplemented with 10% (v/v) heat -inactivated fetal bovine serum 
(FBS; Flow Laboratories, Irvine, CA) , 10 mM HEPES , 2 mM L- 
glutamine, 5 x 10"5 M 2-mercaptoethanol, penicillin (100 ug/ml) . 
This is referred to as complete medium. 

CD34+ cells are seeded for expansion in 25 to 75 cir.2 flasks 
(Corning, NY) at 2 x 10* cells/ml. Optimal conditions are 
maintained by splitting these cultures at day 5 and 10 with medium 
containing fresh GM-CSF and TNF-a (cell concentration: 1-3 x 105 
cells/ml) . In certain cases, cells are FACS sorted for CDla 
expression at about day 6. 

In certain situations, cells are routinely collected after 12 
days of culture, eventually adherent cells are recovered using a 5 
mM EDTA solution. In other situations, the CDla* cells are 
activated by resuspension in complete medium at 5 x 10 6 cells/ml 
and activated for the appropriate time (e.g., 1 or 6 h) with 1 
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Ug/ml phorbol 12-myristate 13-acetate (pma, signta) and 100 ng/inl 
ionomycin (Calbiochem, La Jolla, CA) . These cells are expanded 
for another 6 days, and rna isolated for cDNA library preparation 
Other specific cell types may be similarly isolated. 

II. RNA Isolation and Library Construction 

Total RNA is isolated using, e.g., the guanidine 
thiocyanate/CsCl gradient procedure as described by Chirgwin, et 
al. (1978) BiocheTn . 18:5294-5299. 

Alternatively, poly (A) + RNA is isolated using the OLIGOTEX 
ihpna xsolation kit (QIAGEN) . Double stranded cDNA are generated 
using, e.g., the SUPERSCRIPT plasmid system (Gibco BRL, 
Gaithersburg, ND; for cDNA synthesis and plasmid cloning. The 
resulting double stranded cDNA is unidirectionally cloned, e g 
into psportl and transfected by electroporation into ELECTROMAX ' 
DH10BTM Cells (Gibco BRL, Gaithersburg, MD) . 
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III . Sequencing 

DNA isolated from randomly picked clones, or after 
subtractive hybridization using inactivated cells, are subjected 
to nucleotide sequence analysis using standard techniques. 
Alternatively, selected isolated clones can be selected. A Tag 
DiDeoxy Terminator cycle sequencing kit (Applied Biosystems 
Foster City, CA) can be used. The labeled DNA fragments are 
separated using a DNA sequencing gel of an appropriate automated 
sequencer. Alternatively, the isolated clone is sequenced as 
described, e.g., in Maniatis, et al. (Current ed.) Molecular 
Cloning, A Laboratory Manna] , Cold Spring Harbor Laboratory, Cold 
Spring Harbor Press; Sambrook, et al. (Current ed. ) MaLecular 
30 cl< ?"inq; ft Laboratory Manual, (2ded.), vols. 1-3, CSH Press, NY,- 
Ausubel, et al., Biology., Greene Publishing Associates, Brooklyn,' 
NY,- or Ausubel, et al. (Current ed. , and Supplements) Current 
Protocols i n Molecular Biology , Greene/Wiley, New York. Chemical 
sequencing methods are also available, e.g., using Maxim and 
50 35 Gilbert sequencing techniques. 
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IV. Recombinant gene constructs 

Poly (A) + RNA is isolated from appropriate cell populations 
e.g.. using the FastTrack mRNA kit (Invitrogen, San Diego, CA, 

Samples are electrophoresed, e a in a i* , ^ . . 

' ln a -L* agarose gel containing 

5 formaldehyde and transferred to a GeneScreen membrane (MEN 

Research Products, Boston, MA, . Hybridization is performed e g 
at 65- C in 0.5 M NaHP0 4 pH 7.2, 7% SDS, 1 mM EDTA, and 1% BSA " 
(fraction V) with 32 P -dCTP labeled ^ ^ ^ ^ ^ ^ 
After hybridization, filters are washed three times at 50 • C in 
10 0.2X SSC, 0.1% SDS, e.g., for 30 min. and exposed to fi!m for 24 
h. A positive signal will typically be 2X over background 
preferably 5-25X. 

The recombinant gene construct may be used to generate a 
probe for detecting the message. The insert may be excised and 
used m the detection methods described above. Various standard 
methods for cross species hybridization and washes are well known 
ln the art " See ' e -S-' Sambrook, et al. and Ausubel . 

V: Gene Cloning 

The HDTEA84 was assembled by careful analysis of ESTs found 
in various databases. These ESTs were from cDNA libraries derived 
from Hodgkin-s lymphoma, endothelial cells, keratinocytes 
prostate, and cerebellum. PC R primers are designed and 
synthesized and a PCR product is obtained from any of these 
libraries. This product is used as a hybridization clone to 

screen these libraries for a fun i n , . 

ror a tull length clone, which may include 

a transmembrane segment. 

Likewise, the HSLJD37R was identified from sequences derived 
from CDNA libraries from: smooth muscle, pancreas tumor, 
adipocytes, HUVEC cells, adult pulmonary, endothelial cells 
prostate cell line PC3, microvascular endothelial cells, fetal 
heart, and dendritic cells, a GenBank report by Pan, et al. has 
been submitted. See GenBank Accession 3549263 . Other sequences 
were detected in libraries from: multiple sclerosis lesions 
breast, kidney, and germinal center B cells. rt-PCT showed' signal 
m B cells, pel, granulocytes, T cells, monocytes, dendritic cell 
subpopulations including PMA/ionomycin treated, U937 cells, jy 
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cells, MRC5 cells. CHA, Jurkat, and y C l cells. This suggests that: 
the transcript is widely expressed. 
10 . RAHKL WaS als ° iden tified in cDNA libraries from specific 

tissues, as described. Likewise, the HCC5 chemokine sequence W as 
5 identify. The Dubll and Dubl2 genes were identified, in part 
fro™ their similarity to known Dubl and Dub2 genes. The MD-1 and 
MD-2 were identified. ia part , from their similarity tQ ^ 

f ° r thS RP105 gene - ™ e <*clin E2 was identified based upon its 
similarity to cyclin E. 
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VI. Expression Profile 

To examine DC-PGT mRNA expression standard Northern Blot 
Analysis using a RT-PCR fragment of DC-PGT were carried out 
against human tissue, e.g.. Northern blots containing 
approximately 10 to 20 jtg of total RNA are run in formaldehyde 
gels and transferred to Nytran membranes (Schleicher & Schuell 
Keene, NH> by standard methods, and blots were hybridized with'a 
labeled PCR fragment of DC-PGT and washed at 65" C. cDNA can be 
isolated from cells, embryonic tissues, and adult organs using 
RNAzol solution (Tel-test, Inc., Friendswood, TX) according to 
manufacturer's instructions. Large amounts of plasmid DNA 
containing differential display pcr products are prepared using 
the QIAGEN Plasmid Maxi Kit (QIAGEN) following the manufacturer's 
instructions. Plasmid DNA is cu t with EcoRl (Boehringer Mannheim, 
or BstXI (KE Biolabs, Mass.), gel extracted with the QIA3X gel 
extraction kit (QIAGEN) and random primed with [32 P]dCTP 
(Amersham) using the Prime-It II kit (Stratagene, La Jolla, CA) 
all m accordance with manufacturer's instructions. Various 
primers may be used to guantitate expression of message. Means to 
block DNA hybridization signal, or RNA isolation, will be 
applicable to guantitate roughly the amount of expression of 
45 appropriate RNAs . 

The results revealed mRNA of one band at approximately 9 0 
kB, another band at approximately 3.0 kB, and a 4.4 kB size which 
is consistent with the size predicted for the SEQ ID NO: 1 nucleic 
acid. The smaller mRNA product band could be an alternatively 
spliced form of SEQ ID NO: 1. DC - PG T is highly expressed in both 
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activated and non-activated dendritic cells (DC) , activated 
monocytes, activated granulocytes and adult lung. No expression 
was found in T or PEL cells (either activated or non-activated) 
Minor expression was detected in B cell (both activated and non- 
activated) and limited expression was detected in the brain. The 
results of the northern analysis suggests an expression in 
macrophages, rather than monocytes (Kuppfer cells in the liver 
microglial cells i n the brain, alveolar macrophages in the lung, 
particularly as there is no expression in PEL. Southern 
expression analysis carried out using common techniques confirmed 
the expression pattern revealed in the Northern analysis. 

For example, the DC-PGT tissue distribution seems to have 
highest mRNA levels in kidney, placenta, liver, bone marrow, 
thymus, spleen, lung, and some in testis. This distribution 
corresponds to organs with especially important ion exchange 
features, e.g., Na , K , or Ca. or in hematopoietic organs. 
Generally, the expression is higher in fibroblast and 
hematopoietic cells compared to neuronal cells. 

A probe specific for cDNA encoding the HDTEA84 , HSLJD37R or 
RANKL is used to determine tissue distribution of message encoding 
the antigen. Standard hybridization probes may be used to do a 
Northern analysis of RNA from appropriate sources, either cells, 
e.g.. stimulated, or in various physiological states, in various 
tissues, e.g., spleen, liver, thymus, lung, etc., or in various 
species. Southern analysis of cDNA libraries may also provide 
valuable distribution information. Standard tissue blots or 
species blots are commercially available. Similar techniques will 
be useful for evaluating diagnostic or medical conditions which 
may correlate with expression in various cell types. 

PCR analysis using appropriate primers may also be used. 
Antibody analysis, including immunohistochemistry or PACS, may be 
used to determine cellular or tissue distribution. 

Southern blot analysis of primate cDNA libraries is performed 
on, e.g.,: U937 premonocytic line, resting (M100); elutriated 
monocytes, activated with LPS, IFNy, anti-lL-10 for 4, 16 h pooled 
(M106); elutriated monocytes, activated with LPS, IFNy, il-10 for 
4, 16 h pooled (M107); elutriated monocytes, activated LPS for 1 h 
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(M108); elutriated monocytes, activated LPS for 6 h (M109) • 
dendritic cells (DC, 30% CDla + , from CD34 + GM-CSF. TNFoc ^days, 
resting; DC 70% CDl a+ , f rom CD34 + GM-CSF, TNFOC 12 days, resting 
(D101); DC 70% CDla + , f rom C D34 + GM-CSF , TNFCC 12 days, activated 
with PMA and ionomycin for 1 hr (D102); DC 70% CDla + , from CD34 + 
GM-CSF, TNFa i 2 days, activated with PMA and ionomycin for 6 hr 
(D103); DC 95% CDla + , from CD 3. 4+ gm-csf , TNFa 12 days activated 

GM-CSF, IL-4 5 days, resting (D107, ; dc from monocytes GM-CSF il 
4 5 days, resting (D108) ; DC from monocytes GM-CSF. IL-4 5 days 
activated TNFa, monocyte supe for 4, 16 h pooled (olio,,- EBV 
transfected B cell lines, resting; spleenocytes . resting- 
spleenocytes, activated with PMA and ionomycin; 20 NK clones 
resting, pooled; 20 MK clones activated with PMA and ionomycin, 
15 pooled; NKL clone, il-2 treated; NK cytotoxic clone, resting- 
ad.pose tissue fetal 28 wk male (O108, ; brain fetal 28 wk male 
CO104); gallbladder fetal 28 wk male (O106,,- heart fetal 28 wk 
-le (O103); small intestine fetal 28 wk male (0107,; kidney fetal 
28 wk male (O100) ; l iver fetal 28 wk ^ [Q1Q2) ; ^ ^ ^ ^ 

e (Old,,- ovary f eta l 25 wk female (O i 09); adult placenta 28 wk 
(0113); spleen fetal 28 wk male (0112); testes fetal 28 wk male 
(0111); uterus fetal 25 wk female (O110) ; TH0 clone Mot 72 
resting (T102); T cell, TH0 clone Mot 72, activated with anti-CD28 
and anti-CD3 for 3, 6, 12 h pooled (T103) ; T cell, TH0 clone Mot 
72. anergic treated with specific peptide for 2, 7, 12 h pooled 
(T104); ThO subtraction of resting from activated; T cell TH1 
clone HY06, resting (T107) ; t cell, TH1 clone HY06, activated with 
an tl -CD28 and anti- C D3 for 3, 6, 12 h pooled (T108) ; T cell THl 
clone HY06, anergic treated with specific peptide for 2 , 6 12 h 
pooled (T109),- Thl subtraction of resting from activated; T cell 
TH2 clone HY935, resting (T110); T cell, TH2 clone HY935 
activated with anti-CD28 and anti-CD3 for 2, 7, 12 h pooled 
(Till); and Th2 subtraction of resting from activated. 

Samples for mouse mRNA distribution may include, e g ■ 
resting mouse fibroblastic L cell line (C200) ; Braf:ER (Braf 
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fusion Co estrogen receptor) transfected cells, control (C201) • 
cells, TH1 polarized (Mell4 bright, CD4 + cells from spleen, 
polarized for 7 days with iFN-y and anti IL-4; T200) , T cells, 
polarized (Mell4 bright, CD4+- cells from spleen, polarized for 7 
days with IL-4 and anti-IFN-y,- T201>; T cells, highly TH1 polarized 
(see Openshaw, et al . (1995) J . Exp. Med 182:1357-1367; activated 
with anti-CD3 for 2, 6, 16 h pooled; T202); T cells, highly TH2 
polarized (see Openshaw, et al . (1995) a. Em 182:1357-1367- 
activated with anti-CD3 for 2, 6. 16 h pooled; T203); CD44- CD25 + 
pre T cells, sorted from thymus (T204) ; TH1 T cell clone Dl 1 
resting for 3 weeks after last stimulation with antigen (T205); 
TH1 T cell clone Cl.l, 10 ug/ml ConA stimulated 15 h (T206) ,- TH2 T 
cell clone CDC35, resting for 3 weeks after last stimulation with 
antigen (T207,,- TH2 T cell clone CDC35 , 10 ug/ m l ConA stimulated 
15 h (T208); Mei 14+ naive T cells from spleen, resting (T209); 
Mell4+ T cells, polarized to Thl with IFN-y/IL-12/anti-lL-4 for 6, 
12, 24 h pooled (T210); Mei 14+ T cells, polarized to Th2 with IL- 
4/anti-IFN-y for 6, 13, 24 h pooled (T211) ; unstimulated mature B 
cell leukemia cell line A20 (B200); unstimulated B cell line CH12 
(B201); unstimulated large 3 cells from spleen (B202) ,- B cells 
from total spleen, LPS activated (B203) ; metrizamide enriched 
dendritic cells from spleen, resting (D200); dendritic cells from 
bone marrow, resting (D201) ; monocyte cell line RAW 264 7 
activated with LPS 4 h (M200); bone-marrow macrophages derived 
wxth GM and M-CSF (M201) ; macrophage cell line J774, resting 
(M202); macrophage cell line J774 + LPS + anti-IL-10 at 0.5. 1, 3, 
6, 12 h pooled (M203); macrophage cell line J774 + LPS + IL-lOat' 
0.5, 1, 3, 5, 12 h pooled (M204) , aerosol challenged mouse lung 
tissue, Th2 primers, aerosol OVA challenge 7, 14, 23 h pooled (see 
Garlisi, et al. (1995) Clinical Trnmunoloov anrt Tmmnno™^ ngy 
75:75-83; X206) ; Nippostrongulus-inf ected lung tissue (see 
Coffman, et al. (1989) Science 245:308-310; X200) ; total adult 
lung, normal (O200) ; total lung, rag-1 (see Schwarz, et al . (1993) 
TffTOWMtififiriPnry 4:249-252; O205) ,- IL-10 K.O. spleen (see Kuhn, et 
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«1- 11991) Cell 75:263-274; X201); total adult spleen, normal 

(0201) ; total spleen, rag-1 (O207) ; il-10 k.O. Peyer's oatches 

(0202) ; total Peyer-s patches, normal (0210) ; IL-10 K O 
mesenteric lymph nodes (X203) ; total mesenteric lymph'nodes 
normal (0211); IL-10 K.O. colon <X203); total colon, normal' 
(0212); NOD mouse pancreas (see Makino, et al. (1980) Jikk^n 

fiSbutsu 29:1-13; X205); total thymus, rag-1 (O208) , total kidney 
rag-1 (O209); total heart, rag-1 (O202); total brain, rag-1 

(0203) ; total testes, rag-1 (O204) ,- total liver, rag-1 (O206) ■ rat 
normal joint tissue (0300) ; and rat arthritic joint tissue (X300) 

A. Direct protein detection by antibodies 
Various cells, tissues, and developmental stages are stained 

with labeled antibodies. The detection may be immuno- 
histochemical for solid tissue, by FACS in disperse cells,- and bv 
other appropriate methods for other sample types. Antibodies " 
specific for the various forms may be used to distinguish between 
membrane associated and soluble fragments. Various amplification 
means may be coupled to increase sensitivity. 

B. Functional detection 

0 Specific neutralizing antibodies should provide means to 

specifically block the biological activity of the prostaglandin 
transporter. Activities related to prostaglandin bindina, or to 
prostaglandin transport may be measured by sensitive means based 
upon knowledge of the normal biological function of the various 

5 forms . 

Further testing of populations of cells, e.g., hematopoietic 
progenitors, or of other cell or tissue types will be useful to 
further determine distribution and likely function, other tissue 
types, at defined developmental stages, and pathology samples may 
) be screened to determine whether pathological states or stages may 
be advantageously correlated with the biological activity of the 
transporter. 



VTI. Protein Expression 

PCR is used to make a construct comprising the open reading 
frame, preferably in operable association with proper promoter, 
selection, and regulatory sequences. The resulting expression 
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plasmid is transformed into an appropriate cell type, e g the 
ToppS, E. coli strain (Stratagene, La Jolla, CA> . Ampicillin 
resistant (50 Hg/ml) transformants are grown in Luria Broth 
(Gibco) at 37- C until the optical density at 550 nm is 0.7. 
Recombinant protein is induced with 0.4 mM 

isopropyl-PD-thiogalacto-pyranoside (Sigma, St. Louis, MO) and 
incubation of the cells continued at 20" C for a further 18 hours 
Cells from a 1 liter culture are harvested by centrifugation and 
resuspended, e.g., in 200 ml of ice cold 30% sucrose, 50 mM Tris 
HC1 P H 8.0, 1 mM ethylenediaminetetraacetic acid. After 10 min on 
ice, xce cold water is added to a total volume of 2 li ters After 
20 mm on ice. cells are removed by centrifugation and the 
supernatant is clarified by filtration via a 5 UM Mill ipak 60 
(Millapore Corp., Bedford, MA), 

The recombinant protein is purified via standard purification 
methods, e.g., various ion exchange chromatography methods 
Immunoaffinity methods using antibodies described below can also 
be used. Affinity me-.hods may be used where an epitope tag is 
engineered into an expression construct. 

Similar methods are used to prepare expression constructs and 
cells m eukaryotic cells. Eukaryotic promoters and expression 
vectors may be produced, as described above. 

Further study of the expression and control of prostaglandin 
transporter will be pursued. The controlling elements associated 
with the antigens may exhibit differential developmental, tissue 
specific, or other expression patterns. Upstream or downstream 
genetic regions, e.g., control elements, are of interest. 

Multiple transfected cell lines are screened for one which 
expresses the antigen, membrane bound, or soluble forms, at a high 
level compared with other cells. Various cell lines are screened 
and selected for their favorable properties in handling. Natural 
protein can be isolated from natural sources, or by expression 
from a transformed cell using an appropriate expression vector 
Purification of the expressed protein is achieved by standard 
procedures, or may be combined with engineered means for effective 
purification at high efficiency from cell lysates or supernatant* 
FLAG or His 6 segments can be used for such purification features. 
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VIII. Protein Purification 

10 ThS P^^l-ndin transporter is isolated by a combination of 

affmxty chromatography using the prostaglandin transporter 
5 specific binding compositions, e.g., antibody, as a specific 
binding reagent in combination with protein purification 
techniques allowing separation from other proteins and 
15 contaminants. Various detergent combinations may be tested to 

determine what combinations will retain biological activity while 
solubilizing contaminants. The purification may follow biological 
activity, e.g., prostaglandin binding or transport into membranes 
or by ELISA or other structural binding reagents . 

Similar methods are applied for purification of other 
polypeptides . 



10 

20 



15 



30 20 



25 IX - Isolation of Homologous Genes 

The described genes, e.g., cDNA, can be used as a 
hybridization probe to screen a library from a desired source 
e.g., a primate cell cDNA library. Many different species can be 
screened both for stringency necessary for easy hybridization, and 
for presence using a probe. Appropriate hybridization conditions 
will be used to select for clones exhibiting specificity of cross 
hybridization. 

35 CDNA li^aries from the desired species are collected, from 

« appropriate cell types. Screening by hybridization or PGR using 

degenerate probes based upon the peptide sequences will also allow 
isolation of appropriate clones. Alternatively, use of 
appropriate primers for PGR screening will yield enrichment of 
appropriate nucleic acid clones. 

Similar methods are applicable to isolate either species 
polymorphic, or allelic variants. Species variants axe isolated 
45 USin9 cross - s P ec ies hybridization techniques based upon isolation 

of a full length isolate or fragment from one species as a probe 

Alternatively, antibodies raised against proteins will be 
used to screen for cells which express cross-reactive proteins 
from an appropriate, e.g., cDNA library. The purified protein or 
defined peptides are useful for generating antibodies by standard 
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methods, as described above. Synthetic peptides or purified 
protein are presented to an immune system to generate monoclonal 
or polyclonal antibodies. See, e.g., Coligan (1991) Current. 
Protocol p in Immunology Wiley /Greene; and Harlow and Lane (1989) 
Antibodies: A r^hnral-ory W nmn- Cold Spring Harbor Press. The 
resulting antibodies are used, e.g., for screening, panning, or 
sorting. 



10 



X. Antibody Preparation 

Synthetic peptides or purified protein, natural or 
recombinant, are presented to an immune system to generate 
monoclonal or polyclonal antibodies. See, e.g.. Coligan (1991) 
Current Protocols in Immu n ol o gy Wiley /Greene; and Harlow and 

^ (1989) antibodies: ft Lahnr^nry Ma „„.i Cold Spring Harbor 

15 Press. Polyclonal serum, or hybridomas may be prepared. In 

appropriate situations, the binding reagent is either labeled as 
described above, e.g., fluorescence or otherwise, or immobilized 
to a substrate for panning methods. 

0 XI. Chromosome Mapping 

DNA isolation, restriction enzyme digestion, agarose gel 
electrophoresis. Southern blot transfer and hybridization are 
performed according to standard techniques. See Jenkins, et al 
(1982) J, Virol , . . 43:26-36. Blots may be prepared with Hybond-N 
nylon membrane (Amersham) . The probe is labeled with 32 p _ dCTP; 
washing is done to a final stringency, e.g., of 0.1X SSC. 0.1%' 
SDS, 65* C. 

Alternatively, a BIOS Laboratories (New Haven, CT) mouse 
somatic cell hybrid panel may be combined with PCR methods. See 
Fan, et al. (1996) Immunngpnpfir g 44:97-103. 

Chromosome spreads are prepared. In situ hybridization is 
performed on chromosome preparations obtained from 
phytohemagglutinin-stimulated human lymphocytes cultured for 72 h. 
5-bromodeoxyuridine is added for the final seven hours of culture 
(60 ng/ml of medium) , to ensure a posthybridization chromosomal 
banding of good quality. 



5 
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A PGR fragment, amplified with the help of primers, is cloned 
into an appropriate vector. The vector is labeled by nick- 
translation with 3 H . The radiolabeled probe . s hybrid . zed fco 
metaphase spreads at final concentration of 200 ng/ml of 
hybridization solution as described in Mattel, et al. (l 98 5) Hum. 
Genet. 69:327-331. 

After coating with nuclear track emulsion (KODAK NTB 2 ) 
slides are exposed. To avoid any slipping of silver grains during 
the banding procedure, chromosome spreads are first stained with 
buffered Giemsa solution and metaphase photographed. R-banding is 
then performed by the f luorochrome-photolysis-Giemsa (FPG) method 
and metaphases rephotographed before analysis. 

Using these techniques, the DC-PGT gene was mapped to marker 
SHGC-3911 on chromosome llqlB with a resulting lod score of 
15 1000.0. Other markers in the SHGC-3911 region at chromosome ll g i 3 
include the Fccri receptor which is alleged to be associated with 
allergic conditions. m comparison to the location of DC-PGT the 
ubiquitously expressed human PGT homologue of Lu et al ., (described 
above) is localized to chromosome 7. 



10 



0 



XII. Biochemical Characterization 

Constructs for the expression of. e.g., DC -p G T are made with 
a tag (FLAG) sequence (Hopp, et al . (1988) Biotech™ ! ^ (Ny) 
6:1205-1210) introduced in the protein. The open reading frame of 
the DC-PGT cDNA of SEQ ID NO: 1 is amplified by appropriate PCR 
primers using standard methods to introduce the FLAG peptide 
sequence (IB1, Hew Haven, CT) at the C- terminus of the protein 
For example, a PFU enzyme (Stratagene) with 12 cycles PCR: 94" C 
30 sec; 55" C 1 min ; 72*0 4 min. PCR constructs are cloned into 
a PME18X vector (DNAX) using Xhor and Xbal sites incorporated into 
the 5' and 3' primers, respectively. 

COS-7 cells are maintained in DMEM, 10% FCS. 4 mM L-glutamine 
(JRH Biosciences, Lenexa, KS) , 100 U/ml penicillin, and 100 ug/ml 
streptomycin. Plasmid DNA is transfected by electroporation 
CBIORAD, Hercules, CA) (20 (lg / 1 x 1 0 7 ce ll s , ^ plated intQ 
tissue culture dishes. The medium is replaced after 24 hours and 
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cell lysates and media are collected three days after 
transaction. Lysis buffer (25 mM Hepes pH 7.5, 2 mM EDTA 10% 
NP-40, 150 mMNaCl, 0.01% Aprotinin (Sigma, St. Louis, MO) 0 01% 
Leupeptin (Sigma)) is added to the plates. Plates are kept on 
ice for 45 minutes. Lysates are centrifuged for 15 minutes to 
eliminate cell debris. Supematants of centrifuged cell lysates 
and sterile-filtered media from cultured cells are incubated with 
anti-FLAG M2 Affinity Gel (IBI) at 4 • C overnight and washed four 
times with PBS. Immunoprecipitates are eluted in a Econocolumn 
(BIORAD) with 2.5 M Glycine. pH 2.5. Eluates are neutralized with 
Hepes, P H 7.4 (jrh Biosciences) and concentrated by precipi cation 
with 24% TCA and 2% deoxycholic sodium salt (Sigma) . Pellets are 
eluted in 2 x Sample Buffer (NOVEX, San Diego, CA) , 

electrophoresed on 4-20% tris-glycine gels (Novex)'and transferred 
to PVDF membranes < Immobilon-P, Millipore Corporation, Bedford, 
MA). Membranes are exposed to 3% non-fat milk for 1 h at 37* C 
Anti-FLAG M2 antibody is used as recommended (IBI) . Anti-mouse Ig 
horseradish peroxidase conjugate (Amersham) is used at 1:2,000 
dilution and the peroxidase detection is performed with ECL 
30 20 detection reagents (Amersham) . 

Other fusion proteins can be produced, e.g., a recombinant 
prostaglandin transporter construct is prepared, e.g., as a fusion 
product with a useful affinity reagent, e.g., FLAG peptide. This 
peptide segment may be useful for purifying the expression product 
25 of the construct. See, e.g., Crowe, et al . (1992) OIAexnre^ ■ ^ 
High Level Expression ft P rotein PuH f icaUnn fiy ^.. QUIAGEN, inc. 
Chatsworth, CA; and Hopp, et al. (1988) Bio/Technm^ry 6:1204- 
«> 1210 • Manbranes comprising the transporter are assayed to 

determine the natural prostaglandin substrate. Most likely the 
prostaglandin will be a uracil related prostaglandin, but may also 
include, at various levels of efficiency of binding or transport, 
pyrimidine or purine analogs. See, e.g., Goodman and Oilman 
(Current ed. ) , The Pharmacologi c Basis o f Ther^„,-i ^ ■ Lukovics 
and Zablocka Nucleoside Synthesis: OroanosiTi^ M 0 .y^ EUis 
Horwood. N.Y.; Townsend, Chemistry of Muc1^ g ^ 00 and w , lf o^n„~, 
vols. 1-3. Plenum Press, n.Y. ; Munch-Pertson (1983) Metabolic of 
Nucleotides, Nu cle osides, and Prostagland i ns in M^r 0 0rnani STnfi 
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Academic Press. ny ; Gehrke (1990) g^^^ „ Mn dif<rttM>w| w<t 

NUCl6QS id<aS V ° 1S - A ' B ' «* C. Elsevier; Bloch ( 1975 ) Shsmis ^ 
B i o l ogy, ft Hin i cal n^s of N „^ g ^ Q ap ,-,^„ Annala ^ Acad 

Sci.; andUlbricht (1964) Purines. Pvri^i nPB , . m.,^^- 
Franklin Co. 



10 

20 



25 



30 20 



/5 XI11 - Expression Cloning; Partner Screening 

A. Antibodies and f low-cytometric sorting 
Expression cloning of cells transformed with an appropriate 

cDNA ixbrary may be sorted by FACS using antibody reaoents 
described above. The sorted cells are isolated and expanded, and 
subjected to multiple selection cycles, leading to a high 
proportion of cells expressing the desired DNA. 

B. Antibodies and staining 
15 The antibodies to, e.g., DC-PGT, are used for screening of a 

library made from a cell line which expresses the polypeptide. 
Standard staining techniques are used to detect or sort 
intracellular or surface expressed ligand, or surface expressing 
transformed cells are screened by panning. Screening of 
intracellular expression is performed by various staining or 
immunofluorescence procedures. See also McMahan, et al. (1991) 
EMBSLsL. 10:2821-2832. 

For example, on day 0, precoat 2-chamber permanox slides with 
1 ml per chamber of fibronectin, 10 ng/ml in PBS , for 30 min at 
25 room temperature. Rinse once with PBS. Then plate COS cells at 
2-3 x 105 cells per chamber in 1.5 ml of growth media. Incubate 
overnight at 37° c. 

On day 1 for each sample, prepare 0.5 ml of a solution of 66 
Hg/ml DEAE-dextran, 66 UM chloroquine, and 4 ag dna in serum free 
30 DME. For each set, a positive control is prepared, e.g., of huIL- 
45 10 -FLAG cDNA at 1 and 1/200 dilution, and a negative mock. Rinse 

cells^with serum free DME. Add the DNA solution and incubate 5 hr 
at 37° C. Remove the medium and add 0.5 ml 10% DMSO in DME for 
2.5 min. Remove and wash once with DME. Add 1.5 ml growth medium 
50 35 and incubate overnight. 
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On day 2, change the medium. On days 3 or 4, the cells are 
fixed and stained. R inse the cells twice ^ Hank . s Buffered 
Saline Solution <HBSS» and fix in 4% paraformaldehyde 
(PFA) /glucose for 5 min. Wash 3X with HBSS. The slides may be 
stored at -80° C after all l igu i d is removed. For each chamber 
0.5 ml incubations are performed as follows. Add HBSS/saponin ' 
(0.1%) with 32 ul/ml of 1M NaN3 f or 20 min. Cells are then washed 
with HBSS/saponin lx. Soluble antibody is added to cells and 
incubate for 30 min. Wash cells twice with HBSS/saponin Add 
second antibody, e.g.. Vector anti-mouse antibody, at 1/200 
dilution, and incubate for 30 min. Prepare ELISA solution e g 
Vector Elite ABC horseradish peroxidase solution, and preincubate 
for 30 mm. Use, e.g., 1 drop of solution A (avidin) and 1 drop 
solution B (biotin) per 2 . 5 ml HBSS/saponin. Wash cells twice 
15 with HBSS/saponin. Add ABC HRP solution and incubate for 30 min 
Wash cells twice with HBSS, second wash for 2 min, which closes ' 
cells. Then add Vector diaminobenzoic acid (DAB) for 5 to 10 min 
Use 2 drops of buffer plus 4 drops DAB plus 2 drops of H 2 0 2 per 5 " 
ml of glass distilled water. Carefully remove chamber and rinse 
0 slide in water. Air dry for a few minutes, then add 1 drop of 
Crystal Mount and a cover slip. Bake for 5 min at 85-90° C 

Alternatively, the antibodies to a selected protein are used 
to affinity purify or sort out cells expressing the antigen See 
e.g., Sambrook et al . or Ausubel et al, which are incorporated 
5 herein by reference. The antigen is typically expressed on the 
cell surface. 

Hybridization approaches may also be utilized to find closely 
related variants of the antigen based upon nucleic acid 
hybridization . 

) 

XIV. Screening for DC-PGT Substrate Specificity 

The types of organic anions transported by DC-PGT of the 
present invention can be directly tested using standard methods 
For example, DC-PGT cDNAs can be expressed in HeLa cell monolayers 
or m Xenopus oocytes to determine the ability of DC-PGT to uptake 
various tracer labeled substrates e.g., prostaglandins such as 
PGE1, PGE2, PGE2a, PGD2 , thrombaxanes such as TxB2 or non- 
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prostaglandin anionic substrates such as glutathione, p-amino 
hxppurate, taurochoalate, urate, unconjugated and conjugated 
bxlxrubin, and estradiol glucouronide. For example, for oocyte 
expression, water or complementary RNA (cRNA) that has been 
transcribed in vitro fro, DC-PGT cDNA and capped is injected into 
xenopus oocytes at approximately 50 ng of crna per oocyte. Uptake 
studies are performed 2 to 3 days after injection by washirg of 
oocyte, three s, in Waymouth • s solution, incubating for various 
Periods at 27'C with radioactive substrates (approx. 0.25 uCi/ml- 
total concentration, approx. 1 nM> . washing three tin.es with ice- 
cold Waymouth's solution, and lysing in 0.5 ml of 10% SDS 
Oocyte-associated radioactivity is determined by liquid 
scintillation spectroscopy. For HeLa cell expression, cells are 
grown to approx. 80% confluence on 35 mm dishes then infected with 
recombinant vaccinia virus vTF7-3 of 10 plague forming units per 
cell according to a method of Fuesst, et al . (l 986 ) Proc . 
ACad - SC1 ' • USA 83 : "22-8126. Thirty minutes after infection cells 
are transfected with DC-PGT cDNA (10 ug/ml) plus lipofectin (20 
Ug/ml) according to a method of Blakely, et al. (1991) An al 
20 Si Qshmu 194:302-310. After 3 hours of incubation, vaccinia virus 
an the DMA- lipofectin complex are removed, and the cells are 
maxntaxned overnight ir. Dulbecco ■ s modified Eagle's medium 
supplemented with 5% fetal bovine serum. Uptake studies are 
performed 19 hours after transf ection. Monolayers are washed 
three txmes with culture medium without serum and incubated for 
various times at 27° c with radioactive substrate (0.5 nci/mi pe~ 
dish; total concentration, approx. 0.2 nM, . Uptake is stopped by 
washxng cells once with ice-cold Waymouth's solution containing 5% 
bovine serum albumin and then four times with Waymouth-s solution 
30 alone. Cells are scrapped and the associated radioactivity is 
measured by liquid scintillation spectroscopy. 

XV. Measuring DC-PGT Substrate Uptake Kinetics 

Competitive tracer uptake kinetics using DC-PGT comparing 
35 various prostaglandins or thromboxanes (e.g., PGE i, roE2< PGE2a 

PGD2 or TXB2) are determined using standard competitive transport 
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assays. For example for determining time dependent uptake of 
tracer labeled prostaglandin uptakes into HeLa cells expressing 
DC-PGT clones the following 3 [H] _ PGs final concentrations are ufi 
(New England Nuclear, Boston, MA): P G E 2 : 0.7 nM C176 cpm/fmol) , 
PGE 1: 0.6 nM (62 cpm/fmol); PGD 2 : 0.9 nM (126 cpm/fmol); PGF 2a : 
0.6 nM (185 cpm/fmol); TXB2 : 1.0 nM (114 cpm/fmol); PG I 2 analog 

3 [H] -iloprost (Amersham Corp., Arlington Heights, IL) at 7.9 nM 
(14 cpm/fmol) . 

XVI. Determining DC-PGT uptake inhibition 

Compositions inhibiting DC-PGT uptake can also be measured. 
For example to measure the inhibition of tracer PGE 2 , uptakes at 
10 min intervals (0.2 nM 3[h]-pge 2 ) with or withou- various 
concentrations of unlabeled prostanoids PGE 2 , PGEi, PGD 2 , PGF 2ot 
TJCB2, PGI 2 , (100-500 nM; Cayman Chemical, Ann Arbor MI) or 
inhibitors such as furosemide, probenecid, and indomethacin (10- 
100 UM. Sigma Chemical Co., St. Louis, MO) are determined in 
duplicate on a given transfection for one or two separate 
transfections. since the substrate concentrations are at least 
500 times less than the concentration of unlabeled prostanoids an 
apparent affinity constant, Ki /2 is determined from the equation: 
Kl/2 = fvi/(v-vi)] [i] where v = uptake without inhibitor, vi = 
uptake with inhibitor, and i - inhibitor concentration as 
described by Neame and Richards (1972) in Elementary Kinetic* nf 
Membrane Carrier Transp ort, John Wiley & Sons, New York. 

XVTI. Screening for Agonists or Antagonists 

Using a HeLa or Xenopus system, described above, or a 
comparable system, one of ordinary skill in the art can use the 
DC-PGT of the invention to screen for inhibitors or agonists of 
DC-PGT mediated tracer transport. The efficacy of potential 
antagonists can be compared with known PG transport inhibitors 
such as furosemide, probenecid, or indomethacin. Potential 
agonist or antagonist compositions are incubated, using a system 
as described above, for a time sufficient to allow binding of the 
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test composition and the DC-PGT transporter. Enhancement or 
decrement in measures of tracer uptake can be correlated to the 
specific composition being tested. Accordingly, one can identify 
compounds or compositions that modulate organic anion transport 
5 vxa the DC-PGT transporter of the invention by assessing the 

uptake of various anions such as prostaglandins or thrombaxanes in 
the presence and absence of the compound or compositions being 

15 t6Sted - Sin,ilar methods ™y ^ used to screen for substrates for 

the enzymes, e.g., Dubs and cyclin E2 . 

10 

XVIII. Isolation of Ligand for Receptor 

A construct for expression of the produce can be used as a 
specify binding reagent to identify its binding partner e g 
ligand, by taking advantage of its specificity of binding, much 
15 like an antibody would be used. A receptor reagent i s either 
25 labeled as described above, e.g.. fluorescence or otherwise or 

immobilized to a substrate for panning methods. See also 
Anderson, et al . (1997) ^Jaiyre 390:175-179, which is incorporated 
herein by reference. 

The binding composition is used to screen an expression 
library made from a cell line which expresses a binding partner 
e.g., TNF family l igand . Standard staining techniques are used to 
detect or sort intracellular or surface expressed receptor or 
surface expressing transformed cells are screened by panning 
Screening of intracellular expression is performed by various 
staining or immunofluorescence procedures. See also McMahan et 
al. (1991) EMBO J. 10:2821-2832. 

Alternatively, receptor reagents are used to affinity purify 
or sort out cells expressing a receptor. See, e.g.. Sambrook, et 
30 al. or Ausubel, et al. 

Another strategy is to screen for a membrane bound ligand by 
45 panning. The cDNA containing ligand cDNA is constructed as 

described above. The ligand can be immobilized and used to 
immobilize expressing cells. Immobilization may be achieved by 
use of appropriate antibodies which recognize, e.g., a FLAG 
sequence or a receptor fusion construct, or by use of antibodies 
raised against the first antibodies. Recursive cycles of 
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selection and amplification lead to enrichment of appropriate 
clones and eventual isolation of ligand expressing clones 

Phage expression libraries can be screened by receptor 
Appropriate label techniques, e.g., anti-FLAG antibodies, will 
allow specific labeling of appropriate clones. 
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IX. Chemo taxis Assays 

Cheraokine proteins are produced, e.g., in COS cells 
transfected with a plasmid carrying the chemokine cDNA by 
electroporation. See, Kara, et al . (1992) EMBO ,7. 10:1875-1884 
Physical analytical methods may be applied, e.g., CD analysis to 
compare tertiary structure to other chemokines to evaluate whether 
the protein has likely folded into an active conformation. After 
transfection, a culture supernatant is collected and subjected to 
baoassays. A mock control, e.g., a plasmid carrying the 
25 luciferase cDNA, is used. See, de Wet, et al. (l 987 ) Mol . rp1 , 

BioL, 7:725-757. a positive control, e.g., recombinant murine 
MIP-lcc from R & D Systems (Minneapolis, MN) , i s typically used. 
Likewise, antibodies may be used to block the biological 
20 activities, e.g., as a control. 

Lymphocyte migration assays are performed as previously 
described, e.g., in Bacon, et al . {1988) Br , , T , g5:96f ._ 
35 9 7 4 • 0th « trafficking assays are also available. See eg 

Quidling-jarbrink, et al. (1995, Eur. J. Tm , mn1 25:322-327-' 
25 Koch, et al. (1994) J. Clinic t^,^^ 93:921 _ 928; ^ 
Antony, et al. (1993) ii _rmmunoU 151:7216-7223. Murine Th2 T 
CSl1 Cl ° neS ' CDC " 25 (see Ton ^ et al. (1985) J. Exn ^ i 61 . 223 _ 
241) and HDK-1 (see Cherwinski, et al . (1987) J. Exp. ^ 
166:1229-1244), made available from R. Coffman and A. O'Garra 
30 (DHAX, Palo Alto, CA), respectively, are used as controls 
45 Ca2+ fluX Upon ch emokine stimulation is measured according to 

the published procedure described in Bacon, et al. (1995) ^ 
Immunol 154:3654-3666. 



Maximal numbers of migrating cells in response to MlP-la 
35 typically occur at a concentration of 10~8 M, i„ agreement with 

SUBSTITUTE SHEET ( rule 26 ) 



10 



WO 00/0181 7 „ 

PCT/US99/I2366 

110 

original reports for CD4 + populations of human T cells. See 

Schall (1993) J_ 1 __Exo. Ned 177-1891 ime * ^ 

'-eg, i//.ia21-182 6. A dose-response curve 

is determined, preferably giving a characteristic bell shaped 
dose-response curve. 

5 After stimulation with CC chemokines, lymphocytes generally 

show a measurable intracellular Ca2 + flux. MIP-la is capable of 
IS inducing immediate transients of calcium mobilization. Typically 

the levels of chemokine used in these assays will be comparable to 
those used for the chemotaxis assays (1/1000 dilution of 
10 conditioned supernatants ) . 
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XX. Biological Activities 

A robust and sensitive assay is selected as described above 
e.g., on immune cells, neuronal cells, or stem cells. Chemokine' 
is added to the assay in increasing doses to see if a dose 
response is detected. For example, in a proliferation assay 
cells are plated out in plates. Appropriate culture medium is 
provided, and chemokine is added to the cells in varying amounts 
Growth is monitored over a period of time which will detect either 
20 a direct effect on the cells. or an indirect effect of the 
chemokine . 

Alternatively, an activation assay or attraction assay is 
used. An appropriate cell type is selected, e.g., hematopoietic 
cells, myeloid (macrophages, neutrophils, polymorphonuclear cells 
etc.) or lymphoid (T cell, B cell, or NK cells), neural cells 
(neurons, neuroglia, oligodendrocytes, astrocytes, etc.), or stem 
cells, e.g., progenitor cells which differentiate to other cell 
40 tyP6S ' e - g -' cr YP fc c «Hs and undifferentiated cell types 

Retroviral infection assays have also been described using 
30 e.g., the CCR1. CCR3, and CCR5 receptors. These receptors, which 
bxnd the RANTES and MIP-1 related chemokines, are likely also to 
45 be receptors for the HCC5 . Recent description of these chemokine 

receptors in retroviral infection processes, and the effects by 
the related RANTES and MIP-1 chemokines, suggest similar e ff ects 
35 may exist with the HCC5 . See, e.g., Baiter (1996) Soignee 
50 272:1740 (describing evidence for chemokine receptors as 

coreceptors for HIV) ; and Deng, et al . (1996) 1Ja ture 381:661-666. 
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Chemokines may also be assayed for activity in hemopoietic 
assays as described, e.g., by H. Broxmeyer. See Bellido, et al 
U995) Jl Clinioal Investigation 95:2886-2895; and Jilka, et al 
C1995) Bxpfl Hemat-mnqy 23:500-506. They may be assayed for 
angiogenic activities as described, e.g., by Streiter, et al 
(1992) Am , , j, p a ,- ho i 141:1279-1284. Or for a role in 
inflammation. See. e.g., Wakefield, et al. (1996) j. sura< ra1 
Res. 64:26-31. 

Other assays will include those which have been demonstrated 
wath other chemokines. See, e.g., Schall and Bacon (1994) Current 
Opinion in Tp I P n]n ^ 6:865-873; and Bacon and Schall (1996) Inj^ 
Arch. Allergy & Immunol 109:97-109. 

The DUB genes will be screened for the deubiguitinating 
activities, as described. See, e.g., Hochstrasser (1995) C urr 
15 Opin, CPU Mnl , 7:215-223; Wilkinson, et al . (1995) Biocherni^ry 
34:14535-14546; Baker, et al. ,19 92) J. Binl. rho,n 267:23364- 
23375; Baek et al. (1998) gjo!. r P Pm , 272:25560-25565; and Papa 
and Hochstrasser (1993) Nature. 366:313-319. For example, for an 
in vitro assay for UBP Activity, ^5 x _ laheled ub _ PESTc is used qs 
a .substrate according to the method of Woo, et al. (1995) £^io±_ 
Chern, 270:18766-18773. Reaction mixtures (0 . 1 ml) contain the 
35 proper amount of the enzyme preparations and 10-30 fig of 125 r _ 

labeled Ub-PESTc in 100 mM Tris-HCl (pH 7.8), 1 mM dithiothreitol 
1 mM EDTA, and 5% glycerol. After incubating the mixtures for 
appropriate periods, the reaction is terminated by adding 50 ul of 
40% (w/v) trichloroacetic acid and 50 M l of 1.2* (w/v) bovine 
serum albumin. The samples are centrifuged, and the resulting 
supernatants are counted for their radioactivities using a 
counter. The enzyme activity is expressed as a percentage of 
125!_ labeled u^pes^ hydrolyzed tQ acid _ soluble products , men 
assaying the hydrolysis of Ub-NH-carboxyl extension proteins and 
His-di-Ub, incubations are performed as above but in the presence 
of 5 fig of the substrate. After incubation for appropriate 
periods, the samples are subjected to discontinuous gel 
electrophoresis as described by Baek, et al. (19 98 ) j. Biol, rh^ 
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272:25560-25565. Proteins in the gels were then visualized by 
stainmg with Coomassie Blue R-250 or by exposing to x-ray films 
(Fuji, at 70° C. to prepare 12 5l _ labeled poly-ub-NH-iyso^ 
conjugates. 2 ug of the 125 x . lah&led lysozyme (5 x 1q5 ^ ^ 
5 mcubated with 10 Hg of Ub, 120 ng of fraction II, and an ATP- 
regenerating system consisting of 10 mM Tris-HCl ( p h 7 8) 
15 units /ml creatine phosphokinase, 6 . 5 mM phosphocreatine, 1 5 mM 
ATP, 1 mM dithiothreitol, 0.5 mM MgCl 2 , and 1 mM KC1 in a final 
volume of 0.05 ml. Incubations are performed for 2 h at 37° C in 
10 the presence of 1 mM hemin to prevent proteolysis of the 

ubiquitinated protein conjugates by the 26 S proteosome. After 
incubation, the samples are heated for 10 min at 55° C for 
inactivation of endogenous UBPs . Alternative^, DubH or Dubl2 
can be expressed as a GST fusion protein according to the method 
15 of 2h u, et al. ,1997) J. Biol, ch.m 272=51-57 by cloning into an 
appropriate expression vector and subsequently co- transformed with 
a plasmid encoding ub-Met-P-gal. in which ubiquitin is fused to 
the NH 2 terminus of p-galactosidase and testing for cleavage. 

However, the deubiquitinating enzymes have also been reported 
to have additional functions besides deubiquitination. See e g 
Hochstrasser (1996) QgLL 84:813-815; Hicke and Riezman (1996) Cell 
84:277-287; and Chen, et al . (1996) fiaH 84:853-862. 

The MD gene products will be screened for cell signaling 
activities. See, e.g., Miyake, et al . (1998) J. Immunol. 
161:1348-1353; Kobe and Deisenhofer (1994) Trends Ri^hp. c„,- 
19:412. 



XXI. Antagonizing cyclin E2 proteins 

The inhibition of cell cycle progression is especially 
important for the control of abnormally proliferative diseases, 
e.g., cancer. Several methods are available to accomplish this 
control. The ability of cyclin binding is inhibited by the use, 
e.g., of antibodies raised against the cyclin binding proteins. 
Other elements include, e.g., peptidomimetics which are peptides 
designed to mimic the binding site of cyclin associated proteins 
and disrupt the interaction of these proteins with cyclin. The 
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most effective method to block cell cycle progression is the use 
of small molecules, e.g., to block the interaction of the 
associated proteins with cyclin, or to block downstream activity 
of the associated proteins, as described, e.g., in Hung, et al 
(1996) Chemistry ang Biology 3:623-639. Exposure of a cell to 
these permeable small molecules should cause a conditional loss of 
function of the target protein. 

Also included in this category is the use of gene therapy to 
block the expression of the cyclin associated protein or gene 
transcription factors. Methods of using gene therapy are 
described, e.g., in Goodnow (1992) "Transgenic Animals" in Roitt 
(ed,) Encyclopedia 0 f Immunol ogy , Academic Press, San Diego, pp. 
1502-1504; Travis (1992) Science 256:1392-1394; Kuhn, et al '. 
(1991) Science 254:707-710; Capecchi (1989) Science 244:1288; 
Robertson (1987) (ed.) Tera tocarci nomas ,nrf T^H^ ic * tpm ^ 
Practice] Ap nro^r h , irl Press, Oxford; and Rosenberg (1992) ^ 
QUnical Oncol oqy , 10:180-199. Also included is the use of 
antisense RNA in gene therapy to block expression of the target 
gene, or proper splicing of gene transcripts. 

All citations herein are incorporated herein by reference to 
the same extent as if each individual publication or patent 
application was specifically and individually indicated to be 
incorporated by reference. 

Many modifications and variations of this invention can be 
made without departing from its spirit and scope, as will be 
apparent to those skilled in the art. The specific embodiments 
described herein are offered by way of example only, and the 
invention is to be limited by the terms of the appended claims 
along with the full scope of equivalents to which such claims are 
entitled. 
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WHAT IS CLAIMED IS: 

1. An isolated or recombinant antigenic polypeptide 
10 comprising: 

5 a) a plurality of distinct segments, wherein each said 

segment has identity to at least 12 contiguous amino 
acids from the mature SEQ id NO: 2; or 
b) at least 17 contiguous amino acids from the mature SEQ ID 
NO: 2. 

10 

2. The polypeptide of Claim 1. wherein said plurality of 
segments includes 

a) one of at least 19 contiguous amino acids; or 
b) 



15 
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two of at least 15 contiguous amino acids. 



3- The polypeptide of Claim 1. wherein said polypeptide: 

a) comprises the mature SEQ ID NO: 2; 

b) binds with specificity to a polyclonal antibody which 
specifically binds to SEQ ID NO: 2; or 

20 c) said polypeptide: 

i) is a natural allelic variant of SEQ ID NO: 2; 
xi) is at least 30 amino acids in length; 
iii) exhibits at least two non-overlapping epitopes 
specific for SEQ ID NO: 2; 
25 iv ' is a synthetic polypeptide; 

v) is attached to a solid substrate; or 

vi) is a 5-fold or less conservative substitution from 
SEQ ID NO: 2. 

30 4 - A fusi °n protein comprising first and second portions, 

said first portion comprising a polypeptide of Claim 1 and said 
second portion comprising a detectable marker. 

5. A pharmaceutical composition comprising a sterile 

35 polypeptide of Claim 1 in a pharmaceutical ly acceptable carrier. 

6. An isolated or recombinant polynucleotide encoding a 
50 polypeptide of Claim 1. 



SUBSTITUTE SHEET ( rule 26 ) 



55 



WO 00/01817 



115 



PCT/US99/I2366 



7. The polynucleotide of claim 6, which: 

a) comprises the mature polypeptide coding portion of SEQ ID 

NO: 1; or 

b) encodes the mature SEQ ID NO: 2. 

8- The polynucleotide of Claim 6, wherein said 
polynucleotide is: 

a) a PCR product; 

b) a hybridization probe ; 

c) a mutagenesis primer,- or 

d) made by chemical synthesis. 

9. The polynucleotide of Claim 6, which is: 

a) detectably labeled; 

b) a deoxyribonucleic acid; or 

c) double stranded. 

10. An expression vector comprising a polynucleotide of 
Claim 6 . 

11- The vector of Claim 10, wherein said polypeptide 
specifically binds polyclonal antibodies generated against an 
linmunogen of mature SEQ ID NO: 2. 

12. The vector of Claim 10, which 

a) selectively hybridizes under stringent hybridization 

conditions to a target polynucleotide sequence having at 
least 60 contiguous nucleotides from SEQ ID NO: 1- 

b) encodes a polypeptide having at least 50 contiguous amino 

acid residues from mature SEQ id NO: 2; or 

c) is suitable for transfection into a prokaryote or 

eukaryote host cell. 

13. The vector of Claim 12, wherein said host cell i s , 

a) a mammalian cell; 

b) a bacterial cell- 
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c) an insect cell; 

d) a prokaryote; 

e) a eukaryote; or 

f) a COS cell. 



10 



14. A method of making a polypeptide comprising expressing 
said vector of Claim 13 in said host cell. expressing 

15. An isolated or recombinant polynucleotide which 
hybridizes to the coding portion of SEQ ID NO: 1 under stringent 
hybridation and wash conditions of at least 50 • c a salt 
concentration of less than 400 mM, and 50% f oroide. 

16^ An expression vector comprising the polynucleotide of 



15 Claim 15. 

17. 



The vector of Claim 16 which hybridizes to the coding 
Portion of SEQ XD N0: 1 un der stringent hybridi2ation ^ ff 
conditions of at least 60' c, a salt concentration of less thl 
30 20 200 mM. and 50% rormamide. ^ 

18. The vector of Claim 25, which encodes a polypeptide 

ZVZT* lly binds - ~ ™ — * — 

25 

rn Mo"; T P ° lynUCle0tlde of Claim 15 which hybridizes to SEQ 
ID NO: 1, wherein said polynucleotide is: 

a) a PCR product; 

b) a hybridization probe; 
30 c) a mutagenesis primer; or 

d) made by chemical synthesis. 



20. A method of modulating the physiology or development of 
a cell, comprising contacting said cell with an agonist or 
35 antagonist of a polypeptide of Claim 1. 
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21. A method of detecting the presence of a complementary 
polynucleotide in a sample, comprising contacting a polynucleotide 
of Claim 6 that selectively hybridizes with said complementary 
polynucleotide in said sample to form a detectable duplex; thereby 

> Seating the presence of said polynucleotide in said sample. 

22. A method for identifying a compound that binds to a 
polypeptide of Claim 1, comprising: 

a) incubating components comprising said compound and said 

polypeptide under conditions sufficient to allow the 
components to interact; and 

b) measuring the binding of the compound to said 

polypeptide. 

23. An isolated or recombinant polynucleotide encoding an 
antigenic polypeptide comprising: 

a) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 6 ; 

b) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 8; 

c) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 10; 

d) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 12; 

e) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 17; 

f ) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 19; 

g) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 21; or 

h) at least 17 contiguous amino acids from the mature 

polypeptide from SEQ ID NO: 23. 

24. The polynucleotide of Claim 23, encoding all of the 
polypeptide of: 

a) signal processed SEQ ID NO: 6; 

b) signal processed SEQ ID NO: 8; 

c) signal processed SEQ ID NO: 10; 
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d) signal processed SEQ ID NO: 12; 

e) signal processed SEQ ID NO: 17; 

f) SEQ ID NO: 19; 

g) SEQ ID NO: 21; or 
5 h) SEQ ID NO: 23. 

25. The polynucleotide of Claim 23, which hybridizes at 55 
C, less than 500 mM salt, and 50% formamide to the: 

a) mature protein coding portion of SEQ ID NO: 5; 
10 b) signal processed coding portion of SEQ ID NO: 7; 

c) signal processed coding portion of SEQ ID NO: 9; 

d) signal processed coding portion of SEQ ID NO: 11 ; 

e) mature protein coding portion of SEQ id NO: 15; 

f) polypeptide coding portion of SEQ ID NO: 18; 

15 9) polypeptide coding portion of SEQ ID NO: 20; or 

h) polypeptide coding portion of SEQ ID NO: 22. 



20 



26. The polynucleotide of Claim 25. comprising at least 35 
contiguous nucleotides of: 

a) mature protein coding portion of SEQ ID NO: 5 ; 

b) signal processed coding portion of SEQ ID NO: 7 ; 

c) signal processed coding portion of SEQ ID NO: 9- 

d) signal processed coding portion of SEQ ID NO: ll ; 

e) mature protein coding portion of SEQ ID NO: 16; 

f) polypeptide coding portion of SEQ ID NO: 18; 

9) polypeptide coding portion of SEQ ID NO: 20; or 
h) polypeptide coding portion of SEQ id NO: 22. 

27. An expression vector comprising the polynucleotide of 
Claim 23 . 

28. A host cell containing the expression vector of Claim 
27, including a eukaryotic cell. 

29. A method of making an antigenic polypeptide comprising 
expressing a recombinant polynucleotide of Claim 23. 
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30. A method for detecting a polynucleotide of Claim 23 
comprising contacting said polynucleotide with a probe that ' 
hybr ldlzeS , under stringent conditions, to at least 25 contiguous 
nucleotides of the: u m 

a) mature protein coding portion of SEQ XD NO: 5; 

b) signal processed coding portion of SEQ ZD NO-V 
O signal processed coding portion of SEQ ID NO: 9- 

d) signal processed coding portion of SEQ ID NO: 11 ■ 

e) mature protein coding portion of SEQ ID NO: 16; 

f) polypeptide coding portion of SEQ ID NO: 18; 

g) polypeptide coding portion of SEQ ID NO: 20,' or 

h) polypeptide coding portion of SEQ ID NO- 22- 

to form a duplex, wherein detection of said duplex indicates the 
presence of said polynucleotide. 

31. A kit for the detection of a polynucleotide of Claim 23 
comprising a compartment containing a probe that hybridizes, under 
stringent hybridization conditions, to at least 17 contiguous 
nucleotides of a polynucleotide of Claim bl to form a duplex 



labeled . 
33. 



32. The kit of claim 31. wherein said probe is detectably 



A binding compound comprising an antibody binding site 
which specifically binds to a polypeptide comprising at least 17 
contiguous amino acids from: 



a) 


signal processed SEQ 


ID 


NO 


: 6; 


b) 


signal processed SEQ 


ID 


NO 


: 8; 


c) 


signal processed SEQ 


ID 


NO 


: 10; 


d) 


signal processed SEQ 


ID 


NO 


12; 


e) 


signal processed SEQ 


ID 


NO: 


17; 


f) 


SEQ ID NO: 19; 








g) 


SEQ ID NO: 21; or 








h) 


SEQ ID NO: 23. 








34. 


The binding compound 


Of 


Claim 33, 


a) 


said antibody binding 


site 


is : 
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1) selectively immunoreactive with the: 



a) 


signal 


processed 


SEQ 


ID 


NO: 


6; 


b) 


signal 


processed 


SEQ 


ID 


NO: 


8; 


c) 


signal 


processed 


SEQ 


ID 


NO: 


10 


d) 


signal 


processed 


SEQ 


ID 


NO: 


12 


e) 


signal 


processed 


SEQ 


ID 


NO: 


17 


f) 


SEQ ID 


NO: 19; 










g) 


SEQ ID 


NO: 21; or 










h) 


SEQ ID 


NO: 23; 











15 b) 



2> ralSed against a Purified or recombinant^ produced 
human HDTEA84 protein ; 

3) raised against a purified or recombinant^ produced 
human HSLJD37R protein; or 

4) in a monoclonal antibody, Fab, or F{ab)2; or 
said binding compound is: 

25 1) an antibody molecule; 

2) a polyclonal antiserum; 

3) detectably labeled; 

4) sterile; or 

30 20 5) in a buffered composition. 

35. A method using the binding compound of Claim 33 
comprising contacting said binding compound with a biological 

35 sample comprising an antigen, thereby forming a binding 

25 compound: antigen complex. 

36. The method of Claim 35, wherein said biological sample 
is from a human, and wherein said binding compound is an antibody 

„„ n " • A detectloI > kit comprising said binding compound of 
30 Claim 34, and: 

a) instructional material for the use of said binding 
compound for said detection; or 

b) a compartment providing segregation of said binding 



compound. 



35 



50 
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which ^ / ; UbStantially ^ e « i-olated antigenic polypeptide, 
which binds to said binding composition of Claim 33, and further 
comprises at least 17 contiguous amino acids from: 
a) signal processed SEQ ID NO: 6; 
signal processed SEQ ID NO: 8; 
signal processed SEQ ID NO: 10; 
signal processed SEQ ID NO: 12 
signal processed SEQ ID NO: 17 
SEQ ID NO: 19; 
SEQ ID NO: 21; or 
SEQ ID NO: 23. 



b) 
c) 
d> 
e) 
f) 

S) 
h) 



39. The polypeptide of Claim 38, which: 

a) comprises at least a fragment of at least 25 contiguous 

amino acid residues from a primate HDTEA84 protein - 

b) comprises at least a fragment of at least 25 contiguous 

amino acid residues from a primate HSLJD37R protein - 
O comprises at least a fragment of at least 25 contiguous 
ammo acid residues from a rodent or primate RANKL 
protein; 

d) is a soluble polypeptide; 

e) is detectably labeled; 

f ) is in a sterile composition; 

g) is in a buffered composition; 

h) binds to an sialic acid residue ; 

i) is recombinantly produced, or 

3) has a naturally occurring polypeptide sequence 
40. The polypeptide of Claim 39, which comprises at least 17 
contiguous amino acids from the: 

a) signal processed SEQ ID NO: 6; 

b) signal processed SEQ ID NO: 8; 

c) signal processed SEQ ID NO: 10; 

d) signal processed SEQ ID NO: 12; 

e) signal processed SEQ ID NO: 17; 

f) SEQ ID NO: 19; 

g) SEQ ID NO: 21; or 

h) SEQ ID NO: 23. 
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41. a method of modulating a precursor cell physiology or 
function comprising a step of contacting said cell with- 

a) a biding compound which binds to said polypeptide of 

Claim 38; 

b) an HDTEA84 polypeptide ; 

c) an HSLJD37R polypeptide; or 
<3) a RANKL polypeptide. 

42 The method of Claim <u. wherein said contacti 
combination with a TNF famiJv n^-^ 

f ., ... Nt t£Unlly i^SMd- °r an antagonist of said TNF 

ramily ligana. 

43. A composition of matter selected from: 

a) a substantially pure or recombinant HCC5 polypeptide 

exhibiting identity over a length of at least 12 amino 
acids to SEQ ID NO: 25; 
an^isolated natural sequence HCC5 of mature SEQ id NO: 

a fusion protein comprising HCC5 sequence; 
a substantially pure or recombinant Dubll 'polypeptide 
exhibiting identity over a length of at least about 12 
ammo acids to SEQ ID NO: 32 or 34; 
an isolated natural sequence Dubll of mature SEQ id NO: 
32 or 34; 

a fusion protein comprising Dubll sequence; 
a substantially p Ure or recombinant Dubl2 polypeptide 
exhibiting identity over a length of at least about 12 
ammo acids to SEQ ID NO: 36 or 38; 
an isolated natural sequence Dubl2 of mature SEQ ID NO- 
36 or 38; 

a fusion protein comprising Dubl2 sequence; 
a substantially pure or recombinant MD-1 polypeptide 
exhibiting identity over a length of at least about 12 
amino acids to SEQ ID NO: 42; 

an isolated natural sequence MD-1 of mature SEQ ID NO- 
42; 



b) 

c) 
d> 

e) 
f) 



h) 

i) 
3) 
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1) a fusion protein comprising primate MD-I sequence- 
m) a substantially pure or recombinant MD-2 polypeptide 

exhibiting identity over a length of at least about 12 

amino acids to SEQ ID NO: 44 or 46; 

n) an isolated natural sequence MD-2 of mature SEQ ID NO- 44 
or 46; 

o) a fusion protein comprising primate MD-2 sequence- 
p) a substantially pure or recombinant MD-2 polypeptide 

exhibiting identity over a length of at least about 12 

ammo acids to SEQ ID NO: 48 or 49; 
q) an isolated natural sequence MD-2 of mature SEQ ID N0- 

48; or 

r> a fusion protein comprising murine MD-2 sequence. 

44. The composition of claim 43, which is a substantially 
pure or isolated: 

a) a HCC5 polypeptide, wherein said length is at least 17 

amino acids; 

b) a DubH polypeptide, wherein said length is at least 17 

amino acids,- 

O a Dubl2 polypeptide, wherein said length is at least 17 
amino acids; 

d) a primate MD-1 polypeptide, wherein said length is at 

least 17 amino acids; 

e) a primate MD-2 polypeptide, wherein said length is at 

least 17 amino acids; or 

f) a rodent MD-2 polypeptide, wherein said length is at 

least 17 amino acids . 

45. The composition of Claim 44, which is a substantially 
pure or isolated: 

a) a HCC5 polypeptide, wherein said length is at least 21 

amino acids; 

b) a DubH polypeptide, wherein said length is at least 21 

amino acids; 

O a Dubl2 polypeptide, wherein said length is at least 21 
amino acids ; 
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d) a primate MD-1 polypeptide, wherein said length is at 
least 21 amino acids; 
a primate md-2 polypeptide, wherein said length is at 

least 21 amino acids; and 
a rodent md-2 polypeptide, wherein said length is at 
least 21 amino acids. 



e) 
f) 



46. The composition of matter of claim 43, wherein said- 
a) HCC5 polypeptide: 

i) is from a primate, including a human; 

ii) comprises at least one polypeptide segment of SEQ 
ID NO: 25; 

iii) exhibits a plurality of portions exhibiting said 
identity; 

iv) is a natural allelic variant of HCC5 ; 

v) has a length at least about 30 amino acids- 

vx) exhibits at least two non-overlapping epitopes 
which are specific for a primate HCCS ,- 

vii) exhibits a sequence identity over a length of at 
least 35 amino acids to a HCC5; 

viii) is glycosylated; 

ix) is a synthetic polypeptide; 

x) is attached to a solid substrate; 

xi) is conjugated to another chemical moiety; 

xii) is a 5-fold or less substitution from natural 
sequence; or 

xiii) is a deletion or insertion variant from a natural 
sequence ; 

b) Dubll polypeptide: 

i) is from a primate, including a human; 

ii) comprises at least one polypeptide segment of SEQ 
ID NO: 32 or 34; 

iii) exhibits a plurality of portions exhibiting said 
identity ,- 

iv) is a natural allelic variant of Dubll; 

v) has a length at least about 3 0 amino acids ; 
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vi) exhibits at least two non-overlapping epitopes 
which are specific for a primate Dubll; 

vii) exhibits a sequence identity over a length of at 
least about 35 amino acids to a Dubll; 

viii) is glycosylated; 

ix) is a synthetic polypeptide ; 

x) is attached to a solid substrate; 

xi) is conjugated to another chemical moiety ; 

xii) is a 5-fold or less substitution from natural 
sequence ; or 

xiii) is a deletion or insertion variant from a natural 
sequence; 

c) Dubl2 polypeptide: 

i) is from a primate, including a human; 

ii) comprises at least one polypeptide segment of SEO 
ID NO: 36 or 38; 

iii) exhibits a plurality of portions exhibiting said 
identity; 

iv) is a natural allelic variant of Dubl2; 

v) has a length at least about 30 amino acids - 

vi) exhibits at least two non-overlapping epitopes 
which are specific for a primate Dubl2; 

vii) exhibits a sequence identity over a length of at 
least about 35 amino acids to a Dubl2; 

viii) is glycosylated; 

ix) is a synthetic polypeptide; 

x) is attached to a solid substrate; 

xi) is conjugated to another chemical moiety; 

xii) is a 5-fold or less substitution from natural 
sequence; or 

xiii) is a deletion or insertion variant from a natural 
sequence ; 

d) primate MD-1 polypeptide: 

i) is from a human ; 

ii) comprises at least one polypeptide segment of SEQ 
ID MO: 42; 
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iii) exhibits a plurality of portions exhibiting said 
identity; 

iv) is a natural allelic variant of primate MD-1; 

v) has a length at least about 30 amino acids; 

vij exhibits at least two non-overlapping epitopes 
which are specific for a primate MD-1; 

vii) exhibits a sequence identity over a length of at 
least about 35 amino acids to a primate MD-1; 

viii) is glycosylated; 

ix) is a synthetic polypeptide; 

x) is attached to a solid substrate; 

xi) is conjugated to another chemical moiety ; 
xxi) is a 5-fold or less substitution from natural 

sequence; or 

xiii) is a deletion or insertion variant from a natural 
sequence; 
e) primate MD-2 polypeptide:: 

i) is from a human; 

ii) comprises at least one polypeptide segment of SEQ 
ID NO: 44 or 46; 

iii) exhibits a plurality of portions exhibiting said 
identity; 

iv) is a natural allelic variant of primate MD-2; 

v) has a length at least about 30 amino acids; 

vi) exhibits at least two non-overlapping epitopes 
which are specific for a primate MD-2; 

vii) exhibits a sequence identity over a length of at 
least about 35 amino acids to a primate MD-2; 

viii) is glycosylated; 

ix) is a synthetic polypeptide; 

x) is attached to a solid substrate,- 

xi) is conjugated to another chemical moiety ; 

xii) is a 5-fold or less substitution from natural 
sequence; or 

xiii) is a deletion or insertion variant from a natural 
sequence,- or 

f) rodent MD-2 polypeptide: 
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i) is from a mouse; 

ii) comprises at least one polypeptide segment of SEQ 
ID NO: 48 or 49; 

iii) exhibits a plurality of portions exhibiting said 
identity; 

iv) is a natural allelic variant of rodent MD-2; 

v) has a length at least about 3 0 amino acids; 

vi) exhibits at least two non-overlapping epitopes 
which are specific for a rodent MD-2; 

vii) exhibits a sequence identity over a length of at 
least about 35 amino acids to a rodent MD-2; 

viii) is glycosylated; 

ix) is a synthetic polypeptide ,- 

x) is attached to a solid substrate; 

xi) is conjugated to another chemical moiety; 

xii) is a 5-fold or less substitution from natural 
sequence; or 

xiii) is a deletion or insertion variant from a natural 
sequence. 



47 



^ A composition comprising a sterile polypeptide of claim 
wherein said polypeptide is: 

a) HCC5 polypeptide; 

b) Dubll polypeptide; 

c) Dubl2 polypeptide; 

d) MD-1 polypeptide; or 

e) MD-2 polypeptide. 

48. A composition of Claim 43 comprising: 
a) said HCCS polypeptide and: 

1) a carrier, wherein said carrier is.- 

a) an aqueous compound, including water, 

saline, and/or buffer; and/or 

b) formulated for oral, rectal, nasal, 

topical, or parenteral administration,- 
2} another chemokine, including one selected from the 
group of HCC1, HCC2, HCC3, and HCC4 ; or 
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3) an antibody antagonist for a chemokine, including 
one selected from the group of HCC1, HCC2 , HCC3 
and HCC4; 

b) said Dubll polypeptide and a carrier, wherein said 

carrier is 

a) an aqueous compound, including water, saline, 

and/or buffer; and/or 

b) formulated for oral, rectal, nasal, topical, or 

parenteral administration,- 

c) said Dubl2 polypeptide and a carrier,' wherein said 

carrier is: 

a) an aqueous compound, including water, saline, 

and /or buffer; and /or 

b] formulated for oral, rectal, nasal, topical, or 

parenteral administration; 

d) said MD-1 polypeptide and a carrier, wherein said carrier 
is : 



a) an aqueous compound, including water, saline. 

and/or buffer; and/or 

b) formulated for oral, rectal, nasal, topical, or 

parenteral administration; 
e) said MD-2 polypeptide and a carrier, wherein said carrier 
is : 

a) an aqueous compound, including water, saline, 

and/or buffer; and/or 

b) formulated for oral, rectal, nasal, topical, or 

parenteral administration. 

49. The fusion protein of Claim 43 comprising: 

a) mature protein sequence of Table 7; 

b) mature protein sequence of Table 9 ; 

b) mature protein sequence of Table 11; 

c) a detection or purification tag, including a FLAG, His6, 

or Ig sequence; or 

d) sequence of another chemokine protein with said protein 

in a) . 
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50. A kit comprising a polypeptide of Claim 43, and: 

a) a compartment comprising said polypeptide; and/or 

b) instructions for use or disposal of reagents in said kit. 

51. A binding compound comprising an antigen binding portion 
from an antibody, which specifically binds to a natural: 

a) HCC5 polypeptide of Claim 43, wherein said antibody: 

i) is raised against a peptide sequence of a mature 

polypeptide sequence of Table 7; 

ii) is raised against a mature HCC5; 

iii) is raised to a purified HCC5; 

iv) is immunoselected; 

v) is a polyclonal antibody; 

vi) binds to a denatured HCC5; or 

vii) exhibits a Kd to antigen of at least 30 JIM; 

b) Dubll polypeptide of claim 43, wherein said antibody: 
i) is raised against a peptide sequence of a mature 

polypeptide sequence of Table 9 ; 
li) is raised against a mature Dubll; 

iii) is raised to a purified Dubll; 

iv) is immunoselected; 

v) is a polyclonal antibody; 

vi) binds to a denatured Dubll; or 

vii) exhibits a Kd to antigen of at least 30 UM; 

c) Dubl2 polypeptide of Claim 43, wherein said antibody: 
i) is raised against a peptide sequence of a mature 

polypeptide sequence of Table 9; 
-i) is raised against a mature Dubl2; 

iii) is raised to a purified Dubl2; 

iv) is immunoselected; 

v) is a polyclonal antibody; 

vi) binds to a denatured Dubl2; or 

vii) exhibits a Kd to antigen of at least 30 |jM; 

d) a primate MD-1 polypeptide of Claim 43, wherein said 
antibody: 
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i) is raised against a peptide sequence of a mature 

polypeptide sequence of Table 11; 
n) is raised against a mature MD-1; 

iii) is raised to a purified MD-1; 

iv) is immunoselected; 

v) is a polyclonal antibody; 

vi) binds to a denatured MD-1; or 

vii) exhibits a Kd to antigen of at least 30 JW; 

ej a primate MD-2 polypeptide of Claim 43, wherein said 
antibody: 

i) is raised against a peptide sequence of a mature 

polypeptide sequence of Table 11; 
ia) is raised against a mature MD-2; 

iii) is raised to a purified MD-2; 

iv) is immunoselected; 

v) is a polyclonal antibody; 

vi) binds to a denatured MD-2; or 

vii) exhibits a Kd to antigen of at least 30 UM; or 

f) a rodent MD-2 polypeptide of Claim 43, wherein said 
antibody: 

i) is raised against a peptide sequence of a mature 

polypeptide sequence of Table 11; 

ii) is raised against a mature rodent MD-2; 
ni) is raised to a purified rodent MD-2; 

iv) is immunoselected; 

v) is a polyclonal antibody; 

vi) binds to a denatured rodent MD-2; or 

vii) exhibits a Kd to antigen of at least 30 UM. 

52. The binding composition of Claim 51, wherein: 

a) said polypeptide is from a primate or rodent ,- 

b) said binding compound is an Fv, Fab, or Fab2 fragment; 

c) said binding compound is conjugated to another chemical 

moiety; 

d) is attached to a solid substrate, including a bead or 

plastic membrane; 
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e) is in a sterile composition; or 

f) is detectably labeled, including a radioactive or 

fluorescent label. 

53. A kit comprising said binding compound of Claim 51, and 
a) a compartment comprising said binding compound; 
a compartment comprising purified antigen; and/or 
instructions for use or disposal of reagents in said kit 



b) 
c) 



L ° 54. A method of producing an antigen: antibody complex, 

comprising contacting an antibody of Claim 51 and: 

a) a primate HCC5 polypeptide ; 

b) a primate Dubll polypeptide ,- 

c) a primate Dubl2 polypeptide ,- 
5 d) a primate MD-1 polypeptide; 

e) a primate MD-2 polypeptide; or 

f) a rodent MD-2 polypeptide; 
thereby allowing said complex to form. 

55. A composition comprising said binding compound of Claim 
51 and: 

1) a carrier, wherein said carrier is: 

a) an aqueous compound, including water, saline, and/or 
buffer; and/or 

b) formulated for oral, rectal, nasal, topical, or 
parenteral administration; or 

2) an antibody antagonist for another chemokine, including 
one selected from the group of HCC1, HCC2, HCC3, and 
HCC4 . 

56. An isolated or recombinant nucleic acid encoding a 
polypeptide or fusion protein of Claim 43, wherein: 
A) said HCC5 : 

a) polypeptide is from a primate, including a human; or 

b) nucleic acid: 
i) encodes an antigenic HCC5 peptide sequence of Table 

7; 
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ii) encodes a plurality of antigenic peptide sequences 
of Table 7; 

iii) exhibits identity over at least 25 nucleotides to 
a natural cDNA encoding said HCC5 segment ; 

iv) is a hybridization probe for a gene encoding said 
HCC5 polypeptide; or 

v) further encodes another chemokine. including one 

selected from the group of HCC1, HCC2 , HCC3 , and 
HCC4 ; 

) said Dubll: 

a) polypeptide is from a primate, including a human; or 

b) nucleic acid: 

i) encodes an antigenic Dubll peptide sequence of Table 

9; 

ii) encodes a plurality of antigenic peptide sequences 
of Table 9; 

iii) exhibits identity over at least 25 nucleotides to 
a natural cDNA encoding said Dubll segment; or 

iv) is a hybridization probe for a gene encoding said 
Dubll polypeptide ; 

said Dubl2: 

a) polypeptide i s f rom a primate, including a human; or 

b) nucleic acid: 

i) encodes an antigenic Dubl2 peptide sequence of Table 

9; 

ii) encodes a plurality of antigenic peptide sequences 
of Table 9; 

iii) exhibits identity over at least 25 nucleotides to 
a natural cDNA encoding said Dubl2 segment; 

iv) is a hybridization probe for a gene encoding said 
Dubl2 polypeptide; 

said primate MD-1: 

a) polypeptide is from a primate, including a human; or 

b) nucleic acid: 

i) encodes an antigenic MD-1 peptide sequence of Table 
11; 
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ii) encodes a plurality of antigenic peptide sequences 
of Table 11; 

iii) exhibits identity over at least 25 nucleotides to 
a natural cDNA encoding said MD-1 segment; 

iv) is a hybridization probe for a gene encoding said 
Dubll polypeptide; 

said primate MD-2 : 

a) polypeptide is from a human; or 

b) nucleic acid: 

i) encodes an antigenic MD-2 peptide sequence of Table 

11; 

ii) encodes a plurality of antigenic peptide sequences 
of Table 11; 

iii) exhibits identity over at least 25 nucleotides to 
a natural cDNA encoding said MD-2 segment; 

iv) is a hybridization probe for a gene encoding said 
primate MD-2 polypeptide; or 

said rodent MD-2 : 

a) polypeptide is from a mouse; or 

b) nucleic acid: 

i) encodes an antigenic MD-2 peptide sequence of Table 

11; 

ii) encodes a plurality of antigenic peptide sequences 
of Table 11 ,- 

iii) exhibits identity over at least 25 nucleotides to 
a natural cDNA encoding said MD-2 segment; or 

iv) is a hybridization probe for a gene encoding said 
rodent MD-2 polypeptide. 

57. The nucleic acid of Claim 56, which: 

a) is an expression vector; 

b) further comprises an origin of replication; 

c) is from a natural source ; 

d) comprises a detectable label; 

e) comprises synthetic nucleotide sequence; 

f) is less than 6 kb, preferably less than 3 kb; 

g) is from a primate, including a human ; 
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h) comprises a natural full length coding sequence; or 

i) is a PCR primer, PCR product, or mutagenesis primer. 

58. A cell or tissue comprising a recombinant nucleic acid 
of Claim 56, including wherein said cell is: 

a) a prokaryotic cell; 

b) a eukaryotic cell; 

c) a bacterial cell; 

d) a yeast cell; 

e) an insect cell,- 

f) a mammalian cell; 

g) a mouse cell; 

h) a primate cell; or 

i) a human cell. 



59. A kit comprising said nucleic acid of Claim 56, and: 
a) a compartment comprising said nucleic acid; 

a compartment comprising a nucleic acid encoding another 
chemokine, including HCC1, HCC2, HCC3 , and HCC4 ; or 
instructions for use or disposal of reagents in said kit. 



b) 
c) 



60. A nucleic acid which: 

a) hybridizes under wash conditions of 45° C and less than 
2M salt to the polypeptide coding portion of SEQ ID NO- 
24; 

hybridizes under wash conditions of 45° C and less than 
2M salt to the polypeptide coding portions of SEQ ID NO: 
31 or 33; 

hybridizes under wash conditions of 45° c and less than 
2M salt to the coding portions of SEQ ID NO: 35 or 37; 
hybridizes under wash conditions of 45° C and less than 
2M salt to the coding portion of SEQ ID NO: 41; 

e) hybridizes under wash conditions of 45° C and less than 

2M salt to the coding portion of SEQ ID NO: 43 or 45. or 

f) hybridizes under wash conditions of 45° C and less than 

2M salt to the coding portion of SEQ id NO: 47. 



b) 

c) 
d) 
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61. The nucleic acid of Claim 57, wherein: 

a) said wash conditions are at 55° C and/or 500 mM salt; or 

b) said wash conditions are at 65° C and/or 150 mM salt'. 

62. A method of modulating physiology or development of a 
cell or tissue culture cells comprising exposing said cell to an 
agonist or antagonist of HCC5, primate MD-1, primate MD-2 . or 
rodent MD-2. 

63. A method of detecting specific binding to a compound, 
comprising: 

a) contacting said compound to a composition selected from 
the group of: 

i) an antigen binding site which specifically binds to 

a HCC5 chemokine; 

ii) an antigen binding site which specifically binds to 
Dubll; 

iii) an antigen binding site which specifically binds 
to Dubl2; 

iv) an antigen binding site which specifically binds to 
primate MD-1; 

v) an antigen binding site which specifically binds to 

primate MD-2; 

vi) an antigen binding site which specifically binds to 
rodent MD-2; 

vii) an expression vector encoding a HCC5 chemokine or 
fragment thereof ,- 

viii) an expression vector encoding a Dubll or fragment 
thereof; 

ix) an expression vector encoding a Dubl2 or fragment 
thereof; 

x) an expression vector encoding a primate MD-1 or 

fragment thereof; 

xi) an expression vector encoding a primate MD-2 or 
fragment thereof; 

xii) an expression vector encoding a rodent MD-2 or 
fragment thereof; 
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xiii) a substantially pure protein which is 
specifically recognized by said antigen binding 
site of (i) ; 

xiv) a substantially pure protein which is specifically 
recognized by said antigen binding site of (ii); 

xiv) a substantially pure protein which is specifically 

recognized by said antigen binding site of <iii); 
xiv) a substantially pure protein which is specifically 

recognized by said antigen binding site of (iv) ; 
xiv) a substantially pure protein which is specifically 
recognized by said antigen binding site of (v) ; 
20 xiv) a substantially pure protein which is specifically 

recognized by said antigen binding site of (vi) ; 

ix) a substantially pure HCC5 chemokine or peptide 
thereof of Claim 43, or a fusion protein comprising 

25 a 30 amino acid sequence portion of HCC5 chemokine 

sequence; 

x) a substantially pure Dubll or peptide thereof of 
Claim 43, or a fusion protein comprising a 30 amino 
acid sequence portion of Dubll sequence; 

xi) a substantially pure Dubl2 or peptide thereof of 
Claim 43, or a fusion protein comprising a 30 amino 
acid sequence portion of Dubll sequence; 

35 xi) a substantially pure primate MD-1 or peptide 

thereof of Claim 43, or a fusion protein comprising 
a 30 amino acid sequence portion of primate MD-1 
sequence; 

xi) a substantially pure primate MD-2 or peptide 

thereof of Claim 43, or a fusion protein comprising 
a 30 amino acid sequence portion of primate MD-2 
sequence .- 

xi) a substantially pure rodent MD-2 or peptide thereof 

of Claim 43, or a fusion protein comprising a 30 

amino acid sequence portion of rodent MD-2 

35 sequence; and 

b) detecting binding of said compound to said composition 
50 • 

64. An isolated or recombinant polynucleotide which: 



20 

30 



25 
40 

30 

45 
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a> encodes at least 17 contiguous amino acid residues of SEQ 
ID NO: 54; 

b) encodes at least two distinct segments of at least 10 

contiguous amino acid residues of SEQ ID NO 54; or 

c) comprises one or more segments at least 21 contiguous 

nucleotides of SEQ ID NO: 53. 

65. A method of making: 

a) a polypeptide comprising expressing an expression vector 

of Claim 64, thereby producing said polypeptide,- 

b) a duplex nucleic acid comprising contacting a 

polynucleotide of Claim 64 with a complementary nucleic 
acid, thereby resulting in production of said duplex 
nucleic acid; 

c) a synthetic polynucleotide of Claim 64, comprising 

chemically polymerizing nucleotides to produce said 
polynucleotide; or 

d) a polynucleotide of Claim 64 comprising using a PCR 

method. 



66. An isolated or recombinant antigenic polypeptide 
comprising at least: 

a) one segment comprising at least 17 contiguous amino acids 

from SEQ ID NO: 54; or 

b) at least two distinct segments of at least 11 contiguous 

amino acids from SEQ ID NO: 54. 

67. The antigenic polypeptide of Claim 66, comprising at 
least one segment comprising at least 17 contiguous amino acids 
from SEQ ID NO: 54. 

68. The polypeptide of Claim 66, which exhibits at least two 
non-overlapping epitopes which are selective for primate protein 
Of SEQ ID NO: 54. 

69. The polypeptide of Claim 66, wherein said polypeptide: 
a) is a 5-fold or less substitution from a natural sequence ,- 

or 
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b) is a deletion or insertion variant from a natural 
sequence. 

70. A kit comprising said polypeptide of Claim 66, and 
instructions for the use or disposal of said polypeptide or other 
reagents of said kit. 

71. The antigenic polypeptide of Claim 66, comprising at 
least two distinct segments of. at least 11 contiguous amino acids 
from SEQ ID NO: 54. 

72. The polypeptide of Claim 71: 

a) which comprises at least one sequence from (SEQ ID NO: 

54) KESRYVHD (residues 120-127), DKHFEVLH (residues 127- 
134), HSDLEPQM (residues 134-141), QKDINKNM (residues 
177-184), YAPKLQEF (residues 203-210), SEEDILRM 
(residues 219-226), LRMELIIL (residues 224-231), 
ELCPVTII (residues 237-244), and LFLQVDAL (residues 249- 
256 ) ; and/or 

b) wherein said segments of at least 11 contiguous amino 

acids comprise one said segment with at least 14 
contiguous amino acids from SEQ ID NO: 54. 

73. The polypeptide of Claim 71, which exhibits at least two 
non-overlapping epitopes which are selective for primate protein 
of SEQ ID NO: 54 . 

74. The polypeptide of Claim 71, wherein said polypeptide: 
a) comprises a mature sequence of SEQ ID NO: 2 ,- 

binds with selectivity to an antibody generated against 

an itnmunogen of SEQ ID NO: 54; 
comprises a plurality of polypeptide segments of 17 
contiguous amino acids of SEQ ID NO: 54; or 
d) is a natural allelic variant of SEQ ID NO: 54. 

75. The polypeptide of Claim 71, wherein said polypeptide: 
a) is in a sterile composition; 



b) 
c) 



SUBSTITUTE SHEET ( rule 26 ) 



10 



15 



WO 00/01817 PCT/US99/12366 

139 

b) has a length at least 30 amino acids; 

c) is not glycosylated; 

d) is denatured; 

e) is a synthetic polypeptide; 

f) is attached to a solid substrate; or 

g) is a fusion protein with a detection or purification tag, 

including a FLAG, His6, or Ig sequence. 



76. The polypeptide of Claim 71, wherein said 

10 polypeptide: 

a) is a 5-fold or less substitution from a natural sequence; 
20 or 

b) is a deletion or insertion variant from a natural 

sequence . 

15 

25 7 7 . A kit comprising said polypeptide of Claim 71, and 

instructions for the use or disposal of said polypeptide or other 
reagents of said kit. 

30 20 78. A method using said polypeptide of Claim 71: 

a) to label said polypeptide, comprising labeling said 

polypeptide with a radioactive label; 

b) to separate said polypeptide from another polypeptide in 
35 a mixture, comprising running said mixture on a 

25 chromatography matrix, thereby separating said 

polypeptides; 

c) to identify a compound that binds selectively to said 
40 polypeptide, comprising incubating said compound with 

said polypeptide under appropriate conditions; thereby 
causing said component to bind to said polypeptide; or 

d) to conjugate said polypeptide to a matrix, comprising 
45 derivatizing said polypeptide with a reactive reagent, 

and conjugating said polypeptide to said matrix,- or 

e) inducing an antibody response to said polypeptide, 

35 comprising introducing said polypeptide as an antigen to 

311 immune system, thereby inducing said response. 

50 



30 
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79. A binding compound comprising an antigen binding portio 
from an antibody which binds with selectivity to a polypeptide of 
Claim 66. 



80. A method of evaluating the selectivity of binding of a 
compound to cyclin E2, comprising contacting said compound to a 
recombinant cyclin E2 polypeptide and at least one other cyclin; 
and comparing binding of said compound to said cyclins . 

81. The polypeptide of Claim 67: 

a) which comprises at least one sequence from (SEQ ID NO: 

54) KESRYVHD (residues 120-127), DKHFEVLH (residues 127- 
134), HSDLEPQM (residues 134-141), QKDINKNM (residues 
177-184), YAPKLQEF (residues 203-210), SEEDILRM 
(residues 219-226), LRMELIIL (residues 224-231), 
ELCPVTII (residues 237-244), and LFLQVDAL (residues 249- 
2 56) ; and/or 

b) wherein said segment comprising at least 17 contiguous 

amino acids exhibits at least 23 contiguous amino acids 
from SEQ XD NO: 54. 



82. The polypeptide of Claim 67, wherein said polypeptide: 

a) comprises a mature sequence of SEQ ID NO: 54; 

b) binds with selectivity to an antibody generated against 

an immunogen of SEQ ID NO: 54; 

c) comprises a plurality of polypeptide segments comprising 

at least 17 contiguous amino acids of SEQ ID NO: 54; or 

d) is a natural allelic variant of SEQ ID NO: 54. 



83. The polypeptide of Claim 67, wherein said polypeptide: 

a) is in a sterile composition; 

b) has a length at least 30 amino acids; 

c) is not glycosylated; 

d) is denatured; 

e) is a synthetic polypeptide; 

f ) is attached to a solid substrate; or 
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g) is a fusion protein with a detection or purification tag, 
including a FLAG, His6, or Ig sequence. 

84. A method using said polypeptide of Claim 67: 

a) to label said polypeptide, comprising labeling said 

polypeptide with a radioactive label; 

b) to separate said polypeptide from another polypeptide in 

a mixture, comprising running said mixture on a 
chromatography matrix, thereby separating said 
polypeptides ; 

c) to identify a compound that binds selectively to said 

polypeptide, comprising incubating said compound with 
said polypeptide under appropriate conditions; thereby- 
causing said component to bind to said polypeptide; 

d) to conjugate said polypeptide to a matrix, comprising 

derivatizing said polypeptide with a reactive reagent, 
and conjugating said polypeptide to said matrix; or 

e) inducing an antibody response to said polypeptide, 

comprising introducing said polypeptide as an antigen to 
an immune system, thereby inducing said response. 
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SEQUENCE LISTING 

<110> Schering Corporation 

<120> Mammalian Genes; Related Reagents and Methods 

<130> SF0819x(omnibusFF) 

<140> 
<141> 

<160> 56 

<170> Patent In Ver. 2.0 



<210> 1 
<211> 4064 
20 <212> DNA 

<213> Unknown 

<220> 
<221> CDS 
25 <222> (264) .. (2390) 

<220> 

<223> Description of Unknown Organism: primate 
30 <400> 1 

gtgaccaggg agacaaacac ttggagatac ttggggctga gtttgagcaa gactccctaa 60 

cctgtgtctg gacaagtctg atgtcctgtg tggcccaaga agaactgacc ccgtgtctgg 120 

35 agctcccacc gttattgcat ccctgctgtg gctcacctgc tgctgtctcc aggagcccct 180 

gagaagattt gcctcctctc ccctgctaag ctccaggtcc tgagattgaa ttaggggctg 240 

gagctcactg cactccagca gtc atg gga ccc agg ata ggg cca gcg ggt gag 293 
40 Met Gly Pro Arg lie Gly Pro Ala Gly Glu 

15 10 

gta ccc cag gta cca gac aag gaa acc aaa gcc aca atg ggc aca gaa 341 
Val Pro Gin Val Pro Asp Lys Glu Thr Lys Ala Thr Met Gly Thr Glu 
45 15 20 25 

aac aca cct gga ggc aaa gcc age cca gac cct cag gac gtg egg cca 3 89 

Asn Thr Pro Gly Gly Lys Ala Ser Pro Asp Pro Gin Asp Val Arg Pro 
30 35 40 

50 

agt gtg ttc cat aac ate aag ctg ttc gtt ctg tgc cac age ctg ctg 437 

Ser Val Phe His Aan lie Lys Leu Phe Val Leu Cys His Ser Leu Leu 
45 50 55 

55 cag ctg gcg cag etc atg ate tec ggc tac eta aag age tec ate tec 485 
Gin Leu Ala Gin Leu Met lie Ser Gly Tyr Leu Lys Ser Ser He Ser 
60 65 70 

aca gtg gag aag cgc ttc ggc etc tec age cag acg teg ggg czg ctg 533 
60 Thr Val Glu Lys Arg Phe Gly Leu Ser Ser Gin Thr Ser Gly Leu Leu 
75 80 85 90 
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gcc tec ttc aac gag gtg ggg aac aca gcc ttg att gtg tec gcg age 581 
5 Ala Ser Phe Asn Glu Val Gly Asn Thr Ala Leu He Val Phe Val Ser 
95 100 105 

tat ttt ggc age egg gtg cac cga ccc cga atg att ggc tat ggg get 629 
Tyr Phe Gly Ser Arg Val His Arg Pro Arg Met lie Gly Tyr Gly Ala 
10 110 115 120 

ate ctt gtg gcc ctg gcg ggc ctg etc atg act etc ccg cac ttc ate 677 

He Leu Val Ala Leu Ala Gly Leu Leu Met Thr Leu Pro His Phe He 
125 130 135 

15 

teg gag cca tac cgc tac gac aac acc age cct gag gat atg cca cag 725 

Ser Glu Pro Tyr Arg Tyr Asp Asn Thr Ser Pro Glu Asp Met Pro Gin 
140 145 150 

gac tec aag get tec ctg tgc ctg ccc aca acc teg gcc cca gcc teg 773 
Asp Phe Lys Ala Ser Leu Cys Leu Pro Thr Thr Ser Ala Pro Ala Ser 
155 160 165 170 

gcc ccc tec oot ggc aac tgc tea age cac aca gaa acc cag cat ctg 821 
25 Ala Pro Ser Asn Gly Asn Cys Ser Ser Tyr Thr Glu Thr Gin His Leu 
175 180 135 

age gtg gtg ggg ate atg ttc gtg gca cag acc ctg ctg ggc gtg ggc 869 
Ser Val Val Giy He Met Phe Val Ala Gin Thr Leu Leu Gly Val Gly 
30 190 195 200 

ggg gtg ccc att cag ccc ttt ggc ate tec tac ate gat gac ttt gcc 917 

Gly Val Pro lie Gin Pro Phe Gly He Ser Tyr He Asp Asp Phe Ala 
205 210 215 

35 

cac aac age aac teg ccc etc tac etc ggg ate ctg ttt gca gtg acc 965 

His Asn Ser Asn Ser Pro Leu Tyr Leu Gly He Leu Phe Ala Val Thr 

220 225 230 

40 atg atg ggg cca ggc czg gcc ttt ggg ctg ggc age etc atg ctg cgc 1013 
Met Met Gly Pro Gly Leu Ala Phe Gly Leu Gly Ser Leu Met Leu Arg 
235 240 245 250 

ctt tat gcg gac att aac cag atg cca gaa ggt ggt ate age ctg acc 1051 
45 Leu Tyr Val Asp He Asn Gin Met Pro Glu Gly Gly He Ser Leu Thr 
255 260 265 

ata aag gac ccc cga tgg gtg ggt gcc tgg tgg ctg ggt ttc etc ate 1109 
He Lys Asp Pro Arg Trp Val Gly Ala Tip Trp Leu Gly Phe Leu He 
50 270 275 280 

get gcc ggt gca gtg gcc ctg get gcc ate ccc tac ttc ttc ttc ccc 1157 
Ala Ala Gly Ala Val Ala Leu Ala Ala He Pro Tyr Phe Phe Phe Pro 
285 290 295 



55 



aag gaa atg ccc aag gaa aaa cgt gag ctt cag ttt egg cga aag gtc 1205 
Lys Glu Met Pro Lys Glu Lys Arg Glu Leu Gin Phe Arg Arg Lys Val 
300 305 310 



60 tta gca gtc aca gac tea cct gcc agg aag ggc aag gac tct ccc tct 1253 
Leu Ala Val Thr Asp Ser Pro Ala Arg Lys Gly Lys Asp Ser Pro Ser 
315 320 325 330 
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35 



40 



45 



55 



60 



aag cag age cct ggg gag tec acg aag aag cag got ggc eta gte cag 1301 
Lys Gin Ser Pro Gly Glu Ser Thr Lys Lys Gin Asp Gly Leu Val Gin 
335 340 345 



att gca cca aac etg act gtg ate cag ttc att aaa gtc ttc ccc agg 
lie Ala Pro Asn Leu Thr Val He Gin Phe He Lys Val Phe Pro Arg 
10 350 355 360 



1349 



gtg etg etg cag acc eta cgc cac ccc ate ttc ctg ctg gtg gtc ctg 1397 
Val Leu Leu Gin Thr Leu Arg His Pro He Phe Leu Leu Val Val Leu 
365 370 375 

tec cag gta tgc ttg tea tec atg get gcg ggc atg gec acc ttc ctg 1445 
Ser Gin Val Cys Leu Ser Ser Met Ala Ala Gly Met Ala Thr Phe Leu 
380 385 390 

ccc aag ttc ctg gag cgc cag ttt tec ate aca gec tec tac gec aac 1493 
Pro Lys Phe Leu Glu Arg Gin Phe Ser He Thr Ala Ser Tyr Ala Asn 
395 400 405 410 



ctg etc ate ggc tgc etc tec ttc cct teg gtc ate gtg ggc ate gtg 1541 
2b Leu Leu He Gly Cys Leu Ser Phe Pro Ser Val He Val Gly He Val 
415 420 



425 



gtg ggt ggc gtc ctg gtc aag egg etc cac ctg ggc cct gtg gga tgc 1589 
Val Gly Gly Val Leu Val Lys Arg Leu His Leu Gly Pro Val Gly Cys 

435 440 



30 430 



ggt gec etc tgc ctg ctg ggg atg ctg ctg tgc etc ttc ttc age ctg 1637 
Gly Ala Leu Cys Leu Leu Gly Met Leu Leu Cys Leu Phe Phe Ser Leu 
445 450 455 

ccg etc ttc ttt ate ggc tgc tec age cac cag att gcg ggc acc aca 1685 
Pro Leu Phe Phe He Gly Cys Ser Ser His Gin He Ala Gly He Thr 
460 465 470 

cac cag acc agt gec cac cct ggg ctg gag ctg tct cca age tgc atg 1733 
His Gin Thr Ser Ala His Pro Gly Leu Glu Leu Ser Pro Ser Cys Met 
475 480 485 " 490 

gag gec tgc tec tgc cca ttg gac ggc ttt aac cct gtc tgc gac ccc 1781 
Glu Ala Cys Ser Cys Pro Leu Asp Gly Phe Asn Pro Val Cys Asp Pro 
495 500 505 



age act cgt gtg gaa tac ate aca ccc tgc cac gca ggc tgc tea age 1829 
Ser Thr Arg Val Glu Tyr He Thr Pro Cys His Ala Gly Cys Ser Ser 
50 510 515 520 



tgg gtg gtc cag gat get ctg gac aac age cag gtt ttc tac acc aac 1877 
Trp Val Val Gin Asp Ala Leu Asp Asn Ser Gin Val Phe Tyr Thr Asn 
525 530 535 

tgc age tgc gtg gtg gag ggc aac ccc gtg ctg gca gga tec tgc gac 1925 
Cys Ser Cys Val Val Glu Gly Asn Pro Val Leu Ala Gly Ser Cys Asp 
540 545 550 

tea acg tgc age cat ctg gtg gtg ccc ttc ctg etc ctg gtc age ctg 1973 
Ser Thr Cys Ser His Leu Val Val Pro Phe Leu Leu Leu Val Ser Leu 
555 560 565 570 
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ggc teg gec ctg gec tgt etc acc cac aca ccc tec ttc atg etc ate 2021 

5 Gly Ser Ala Leu Ala Cys Leu Thr His Thr Pro Ser Phe Met Leu lie 

575 580 585 

eta aga gga gtg aag aaa gaa gac aag act ttg get gtg ggc ate cag 2069 

Leu Arg Gly Val Lys Lys Glu Asp Lys Thr Leu Ala Val Gly lie Gin 

10 590 595 600 



15 



20 



35 



ttc atg ttc ctg agg att ttg gec tgg atg ccc age ccc gtg ate cac 2117 
Phe Met Phe Leu Arg lie Leu Ala Tip Met Pro Ser Pro Val He His 
605 610 615 

ggc age gee ate gac acc acc tgt gtg cac tgg gec ctg age tgt ggg 2165 
Gly Ser Ala lie Asp Thr Thr Cys Val His Tip Ala Leu Ser Cys Gly 
620 625 630 

cgt cga get gtc tgt cgc tac tac aat aat gac ctg etc ego aac cga 2213 
Arg Arg Ala Val Cya Arg Tyr Tyr Asn Asn Asp Leu Leu Arg Asn Arg 
635 640 645 650 



ttc ate ggc etc cag ttc ttc ttc aaa aca ggt tct gtg ate tgc ttc 2?61 
25 Phe lie Gly Leu Gin Phe Phe Phe Lys Thr Gly Ser Val lie Cys Phe 
655 660 665 

gec tta gtt ttg get gtc ctg agg cag cag gac aaa gag gca agg acc 2309 
Ala Leu Val Leu Ala Val Leu Arg Gin Gin Asp Lys Glu Ala Arg Thr 
30 670 675 680 

aaa gag age aga tec age cct gec gta gag cag caa ttg eta gtg teg 2357 
Lys Glu Ser Arg Ser Ser Pro Ala Val Glu Gin Gin Leu Leu Val Ser 
685 690 695 



ggg cca ggg aag aag cca gag gat tec cga gtg tgagctgtct tggggcccca 2410 
Gly Pro Gly Lys Lys Pro Glu Asp Ser Arg Val 





700 




705 










40 


cctggccaag 


agtagcagee 


acagcagtac 


ctcctctgag 


tcctttgccc 


aagattgggt 


2470 




gtcaagagee 


ctgtgttcca 


ttctggctcc 


tccactaaat 


tgctgtgtga 


cttcaggcaa 


2530 


45 


gacattgatc 


ctctctcagc 


etttgettge 


tagtctgaac 


caaagagttg 


tttgggcatt 


2590 




tgctgtgttg 


gecatttctg 


gagcaagagg 


gtcttcttcc 


tccttccccc 


agccagccag 


2650 




ctgtcctggg 


gecaggcttt 


cctgggtgga 


aagaagtata 


cctttccctg 


gggecctagg 


2713 


50 


atagcaaagt 


gogecotagt 


gggecagget 


gccctccatg 


ctgggcccca 


gcccaggtct 


2770 




gcactcgcct 


ggatcacctt 


etttgagect 


tagccatctc 


ctgtcaggta 


ggaatgaact 


2830 


55 


tgccagcctt 


caggctcgct 


cagctatgac 


catctgtgcg 


gtcagggtac 


actcagctct 


2890 


cctccccaac 


tccagcagcc 


tttaagaagt 


gtccctttgg cgccccctggr 


aggcagagca 


2950 




ctgagctgga 


ccctgggtag 


actcccacag 


ggaggacgga 


gctggcctca 


ggagtgggac 


3010 


60 


acccagactt 


ggcagggect 


teaagaggee 


tgtgtggggg 


ccccaggaat 


ccttagctga 


3070 




ageggggaga 


ctcactctcc 


atctcaggaa 


attctagccc 


ttgccctcag 


ggagecaegg 


3130 




ttgagggtga 


ggcccaacac 


ctgecttagg 


gccctgggtg 


ggcaagtctg 


ggccctgggg 


3190 
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tagggaggga gactcaggcc cacacctggg tattttctaa tttcagacaa acacacactc 3250 
agcgcgcact cactgattcc tacacattgc caagatttca cacatgtgac caggggccac 3310 
caaagtccct gtgacctttg tgactaggat cctaatttct ctattttctc ctgggtgcct 3370 

10 gggtctgtgt cacctggggc agtgtggata atgtttagtt ctgtgacact gttttttggg 3430 
ggtggcacct ggttctccga tgcctgggct ggtgtcaggc ccaggactgt agtgctggga 3490 
gcagtaaagc tcagctctgt gtaatgagtg atgctatggc ttgctcgtgt cttatgatcc 3 5 50 
aatccttttc toeatcagcc cttgttttgt tttatggcta gccttatctg gcctggttat 3610 
ttccttgcgg ggaggagagg gtttgctaat ctgctcccag cccaacctat taccacccca 3670 

20 cctcgctggg acctactgct cgggaggcag cagacaggga gccaccagca gtggcttccr 3730 
ggccctgtgc tgggggtggg gggaagctgg gggcacatgt ggcccttgcc ttccgagcag 3790 

^ ctcccagtgc cagggctttg agactttccc acatgataaa agaaaaggga ggtacagaag 3850 
ttccaattcc ctttttaccc tgctggctgg tatctgtaaa tgtttaataa acatctgagc 3910 
atgtatctat caacgccaag aatttcaaag tctccttcaa caatatgagg cttttaggat 3970 

30 gtttatattc cttcatccct cttgtttccc aggctttgca gggaaaaaag tctggaatta 4030 
tagatacagc ttattattaa atttgttctt gcat 4064 

35 <210> 2 

<211> 709 
<212> PRT 
<213> Unknown 

40 <400> 2 

Mec Gly Pro Arg He Gly Pro Ala Gly Glu Val Pro Gin Val Pro Asp 
15 10 15 

Lys Glu Thr Lys Ala Thr Met Gly Thr Glu Asn Thr Pro Gly Gly Lys 
45 20 25 30 

Ala Ser Pro Asp Pro Gin Asp Val Arg Pro Ser Val Phe His Asn He 
35 40 45 

50 Lys Leu Phe Val Leu Cys His Ser Leu Leu Gin Leu Ala Gin Leu Met 
50 55 60 

He Ser Gly Tyr Leu Lys Ser Ser He Ser Thr Val Glu Lys Arg Phe 
65 70 75 80 



55 



Gly Leu Ser Ser Gin Thr Ser Gly Leu Leu Ala Ser Phe Asn Glu Val 
85 90 95 



Gly Asn Thr Ala Leu He Val Phe Val Ser Tyr Phe Gly Ser Arg Val 
60 100 105 110 

His Arg Pro Arg Met He Gly Tyr Gly Ala He Leu Val Ala Leu Ala 
115 120 125 
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5 Gly Leu Leu Mec Thr Leu Pro His Phe He Ser Glu Pro Tyr Arg Tyr 
130 135 140 

Asp Asn Thr Ser Pro Glu Asp Met Pro Gin Asp Phe Lys Ala Ser Leu 
145 150 155 160 

10 

Cys Leu Pro Thr Thr Ser Ala Pro Ala Ser Ala Pro Ser Asn Gly Asn 
165 170 175 

Cys Ser Ser Tyr Thr Glu Thr Gin His Leu Ser Val Val Gly He Met 
15 180 185 190 

Phe Val Ala Gin Thr Leu Leu Gly Val Gly Gly Val Pro He Gin Pro 
195 200 205 

20 Phe Gly He Ser Tyr He Asp Asp Phe Ala His Asn Ser Asn Ser Pro 
210 215 220 

Leu Tyr Leu Gly He Leu Phe Ala Val Thr Met Met Gly Pro Gly Leu 
225 230 235 240 

25 

Ala Phe Gly Leu Gly Ser Leu Met Leu Arg Leu Tyr Val Asp He Asn 
245 250 255 

Gin Met Pro Glu Gly Gly lie Ser Leu Thr He Lys Asp Pro Arg Trp 
30 260 265 270 

Val Gly Ala Trp Trp Leu Gly Phe Leu He Ala Ala Gly Ala Val Ala 
275 280 285 

35 Leu Ala Ala He Pro Tyr Phe Phe Phe Pro Lys Glu Met Pro Lys Glu 
290 295 300 

Lys Arg Glu Leu Gin Phe Arg Arg Lys Val Leu Ala Vai Thr Asp Ser 
305 310 315 320 

40 

Pro Ala Arg Lys Gly Lys Asp Ser Pro Ser Lys Gin Ser Pro Gly Glu 
325 330 335 

Ser Thr Lys Lys Gin Asp Gly Leu Val Gin He Ala Pro Asn Leu Thr 
45 340 345 350 

Val He Gin Phe He Lys Val Phe Pro Arg Val Leu Leu Gin Thr Leu 
355 360 365 

50 Arg His Pro He Phe Leu Leu Val Val Leu Ser Gin Val Cys Leu Ser 
370 375 380 

Ser Met Ala Ala Gly Met Ala Thr Phe Leu Pro Lys Phe Leu Glu Arg 
385 390 395 400 

55 

Gin Phe Ser He Thr Ala Ser Tyr Ala Asn. Leu Leu He Gly Cys Leu 
405 410 415 

Ser Phe Pro Ser Val He Val Gly He Val Val Gly Gly Val Leu Val 
60 420 425 430 

Lys Arg Leu His Leu Gly Pro Val Gly Cys Gly Ala Leu Cys Leu Leu 
435 440 445 
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Gly Met Leu Leu Cys Leu Phe Phe Ser Leu Pro Leu Phe Phe lie Gly 
5 450 455 460 

Cys Ser Ser His Gin lie Ala Gly lie Thr His Gin Thr Ser Ala His 
465 470 475 480 

10 Pro Gly Leu Glu Leu Ser Pro Ser Cys Met Glu Ala Cys Ser Cys Pro 
485 490 495 

Leu Asp Gly Phe Asn Pro Val Cys Asp Pro Ser Thr Arg Val Glu Tyr 
500 505 510 

15 

He Thr Pro Cys His Ala Gly Cys Ser Ser Trp Val Val Gin Asp Ala 
515 520 525 

Leu Asp Asn Ser Gin Val Phe Tyr Thr Asn Cys Ser Cys Val Val Glu 
20 530 535 540 

Gly Asn Pro Val Leu Ala Gly Ser Cys Asp Ser Thr Cys Ser His Leu 
545 550 555 56C 

25 Val Val Pro Phe Leu Leu Leu Val Ser Leu Gly Ser Ala Leu Ala Cys 
565 570 575 

Leu Thr His Thr Pro Ser Phe Met Leu He Leu Arg Gly Val Lys Lys 
580 585 590 

30 

Glu Asp Lys Thr Leu Ala Val Gly He Gin Phe Ket Phe Leu Arg He 
595 600 605 

Leu Ala Trp Met Pro Ser Pro Val He His Gly Ser Ala He Asp Thr 
35 610 615 620 

Thr Cys Val His Trp Ala Leu Ser Cys Gly Arg Arg Ala Val Cys Arg 
625 630 535 640 

40 Tyr Tyr Asn Asn Asp Leu Leu Arg Asn Arg Phe He Gly Leu Gin Phe 
645 650 655 



45 



Phe Phe Lys Thr Gly Ser Val lie Cys Phe Ala Leu Val Leu Ala Val 
660 665 670 

Leu Arg Gin Gin Asp Lys Glu Ala Arg Thr Lys Glu Ser Arg Ser Ser 
675 680 685 



Pro Ala Val Glu Gin Gin Leu Leu Val Ser Gly Pro Gly Lys Lys Pro 
50 690 695 700 



55 



60 



Glu Asp Ser Arg Val 
705 



<210> 3 
<211> 643 
<212> PRT 
<213> Unknown 



<220> 

<223> Description of Unknown Organism:primate 
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<400> 3 

5 Met Gly Leu Leu Pro Lys Leu Gly Val Ser Gin Gly Ser Asp Thr Ser 
15 10 15 

Thr Ser Arg Ala Gly Arg Cys Ala Arg Ser Val Phe Gly Asn lie Lys 
20 25 30 

10 

Val Phe Val Leu Cys Gin Gly Leu Leu Gin Leu Cys Gin Leu Leu Tyr 
35 40 45 

Ser Ala Tyr Phe Lys Ser Ser Leu Thr Thr He Glu Lys Arg Phe Gly 
15 50 55 60 

Leu Ser Ser Ser Ser Ser Gly Leu He Ser Ser Leu Asn Glu He Ser 
65 70 75 80 

20 Asn Ala lie Leu He He Phe Val Ser Tyr Phe Gly Ser Arg Val His 
85 90 95 

Arg Pro Arg Leu He Gly He Gly Gly Leu Phe Leu Ala Ala Gly Ala 
100 105 110 

25 

Phe He Leu Thr Leu Pro His Phe Leu Ser Glu Pro Tyr Gin Tyr Thr 
115 120 125 

Leu Ala Ser Thr Gly Asn Asn Ser Arg Leu Gin Ala Glu Leu Cys Gin 
30 1-30 135 140 

Lys His Trp Gin Asp Leu Pro Pro Ser Lys Cys His Ser Thr Thr Gin 
145 150 155 160 

35 Asn Pro Gin Lys Glu Thr Ser Ser Met Trp Gly Leu Met Val Val Ala 
165 170 175 

Gin Leu Leu Ala Gly He Gly Thr Val Pro He Gin Pro Phe Gly He 
160 185 190 

40 

Ser Tyr Val Asp Asp Phe Ser Glu Pro Ser Asn Ser Pro Leu Tyr He 
195 200 205 

Ser He Leu Phe Ala He Ser Val Phe Gly Pro Ala Phe Gly Tyr Leu 
45 210 215 220 

Leu Gly Ser He Met Leu Gin Ho Phe Val Asp Tyr Gly Arg Val Asn 
225 230 235 240 

50 Thr Ala Ala Val Asn Leu Val Pro Gly Asp Pro Arg Trp He Gly Ala 
245 250 255 

Trp Trp Leu Gly Leu Leu He Ser Ser Ala Leu Leu Val Leu Thr Ser 
250 265 270 

55 

Phe Pro Phe Phe Phe Phe Pro Arg Ala Met Pro He Gly Ala Lys Arg 
275 280 285 

Ala Pro Ala Thr Ala Asp Glu Ala Arg Lys Leu Glu Glu Ala Lys Ser 
60 290 295 300 

Arg Gly Ser Leu Val Asp Phe He Lys Arg Phe Pro Cys He Phe Leu 
305 310 315 320 
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Arg Leu Leu Met Asn Ser Leu Phe Val Leu Val Val Leu Ala Gin Cys 
5 325 330 335 

Thr Phe Ser Ser Val lie Ala C-ly Leu Ser Thr Phe Leu Asn Lys Phe 
340 345 350 

10 Leu Glu Lys Gin Tyr Gly Thr Ser Ala Ala Tyr Ala Asn Phe Leu lie 
355 360 365 

Gly Ala Val Asn Leu Pro Ala Ala Ala Leu Gly Met Leu Phe Gly Gly 
370 375 380 

15 

lie Leu Met Lys Arg Phe Val Phe Ser Leu Gin Thr lie Pro Arg lie 
385 390 395 400 

Ala Thr Thr lie lie Thr lie Ser Met lie Leu Cys Val Pro Leu Phe 
20 405 410 415 

Phe Met Gly Cys Ser Thr Pro Thr Val Ala Glu Val Tyr Pro Pro Ser 
420 425 430 

25 Thr Ser Ser Ser lie His Pro Gin Ser Pro Ala Cys Arg Arg Asp Cys 
435 440 445 

Ser Cys Pro Asp Ser lie Phe His Pro Val Cys Gly Asp Asn Gly lie 
450 455 460 

30 

Glu Tyr Leu Ser Pro Cys His Ala Gly Cys Ser Asn lie Asn Met Ser 
465 470 475 480 

Ser Ala Thr Ser Lys Gin Leu He Tyr Leu Asn Cys Ser Cys Val Thr 
35 485 490 495 

Gly Gly Ser Ala Ser Ala Lys Thr Gly Ser Cys Pro Val Pro Cys Ala 
500 505 510 

40 His Phe Leu Leu Pro Ala He Phe Leu He Ser Phe Val Ser Leu lie 
515 520 525 

Ala Cys lie Ser His Asn Pro Leu Tyr Met Met Val Leu Arg Val Val 
530 535 540 

45 

Asn Gin Glu Glu Lys Ser Phe Ala He Gly Val Gin Phe I.eu Leu Met 
545 550 555 560 

Arg Leu Leu Ala Trp Leu Pro Ser Pro Ala Leu Tyr Gly Leu Thr He 
50 565 570 575 

Asp His Ser Cys He Arg Trp Asn Ser Leu Cys Leu Gly Arg Arg Gly 
580 585 590 

55 Ala Cys Ala Tyr Tyr Asp Asn Asp Ala Leu Arg Asp Arg Tyr Leu Gly 
595 600 605 

Leu Gin Met Gly Tyr Lys Ala Leu Gly Met Leu Leu Leu Cys Phe He 
610 615 620 

60 

Ser Trp Arg Val Lys jys Asn Lys Glu Tyr Asn Val Gin Lys Ala Ala 
625 630 635 640 

Gly Leu He 
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5 <210> 4 

<211> 643 
<212> PRT 
<213> Unknown 

10 <220> 

<223> Description of Unknown Organism: rodent 
<400> 4 

Met Giy Leu Leu Leu Lys Pro Gly Ala Arg Gin Gly Ser Gly Thr Ser 
15 1 5 10 15 

Ser Val Pro Asp Arg Arg Cys Pro Arg Ser Val Phe Ser Asn lie Lys 
20 25 30 

20 Val Phe Val Leu Cys His Gly Leu Leu Gin Leu Cys Gin Leu Leu Tyr 
35 40 45 

Ser Ala Tyr Phe Lys Ser Ser Leu Thr Thr lie Glu Lys Arg Phe Gly 
50 55 SO 

25 

Leu Ser Ser Ser Ser Ser Gly Leu lie Ser Ser Leu Asn Glu He Ser 
65 70 75 80 

Asn Ala Thr Leu He He Phe He Ser Tyr Phe Gly Ser Arg Val Asn 
30 85 90 95 

Arg Pro Arg Met He Gly He Gly Gly Leu Leu Leu Ala Ala Gly Ala 
100 105 110 

35 Phe Val Leu Thr Leu Pro His Phe Leu Ser Glu Pro Tyr Gin Tyr Thr 
115 120 125 

Ser Thr Thr Asp Gly Asn Arg Ser Ser Phe Gin Thr Asp Leu Cys Gin 
130 135 140 

40 

Lys His Phe Gly Ala Leu Pro Pro Ser Lys Cys His Ser Thr Val Pro 
145 150 155 160 

Asp Thr His Lys Glu Thr Ser Ser Leu Trp Gly Leu Met Val Val Ala 
45 165 170 175 

Gin Leu Leu Ala Gly He Gly Thr Val Pro He Gin Pro Phe Gly He 
180 185 190 

50 Ser Tyr Val Asp Asp Phe Ala Glu Pro Thr Asn Ser Pro Leu Tyr He 
195 200 205 

Ser He Leu Phe Ala He Ala Val Phe Gly Pro Ala Phe Gly Tyr Leu 
210 215 220 

55 

Leu Gly Ser Val Met Leu Arg He Phe Val Asp Tyr Gly Arg Val Asp 
225 230 235 240 

Thr Ala Thr Val Asn Leu Ser Pro Gly Asp Pro Arg Trp He Gly Ala 
60 245 250 255 

Trp Trp Leu Gly Leu Leu He Ser Ser Gly Phe Leu He Vol Thr Ser 
260 265 270 
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Leu Pro Phe Phe Phe Phe Pro Arg Ala Met Ser Axg Gly Ala Glu Arg 
5 275 280 285 

Ser Val Thr Ala Glu Glu Thr Met Gin Thr Glu Glu Asp Lys Ser Arg 
290 295 300 

10 Gly Ser Leu Met Asp Phe lie Lys Arg Phe Pro Arg lie Phe Leu Arg 
305 310 315 320 

Leu Leu Met Asn Pro Leu Phe Met Leu Val Val Leu Ser Gin Cys Thr 
325 330 335 

Phe Ser Ser Val lie Ala Gly Leu Ser Thr Phe Leu Asn Lys Phe Leu 
340 345 350 

Glu Lys Gin Tyr Gly Ala Thr Ala Ala Tyr Ala Asn Phe Leu lie Gly 
20 355 360 365 

Ala Val Asn Leu Pro Ala Ala Ala Leu Gly Met Leu Phe Gly Glv He 
370 375 380 

25 Leu Met Lys Arg Phe Val Phe Pro Leu Gin Thr lie Pro Arg Val Ala 
385 390 395 400 

Ala Thr He lie Thr He Ser Met He Leu Cys Val Pro Leu Phe Phe 
405 410 415 

30 

Met Gly Cys Ser Thr Ser Ala Val Ala Glu Val Tyr Pro Pro Ser Thr 
420 425 430 

Ser Ser Ser He His Pro Gin Gin Pro Pro Ala Cys Arg Arg Asp Cys 
35 435 440 445 

Ser Cys Pro Asp Ser Phe Phe His Pro Val Cys Gly Asp Asn Gly Val 
450 455 460 

40 Glu Tyr Val Ser Pro Cys His Ala Gly Cys Ser Ser Thr Asn Thr Ser 
465 470 475 480 

Ser Glu Ala Ser Lys Glu Pro He Tyr Leu Asn Cys Ser Cys Val Ser 
485 490 495 

45 

Gly Gly Ser Ala Ser Gin Asp Arg Leu Met Pro His Val Leu Arg Ala 
500 505 510 

Leu Leu Leu Pro Ser He Phe Leu He Ser Phe Ala Ala Leu He Ala 
50 515 520 525 

Cys He Ser His Asn Pro Leu Tyr Met Met Val Leu Arg Val Val Asn 
530 535 540 

55 Gin Asp Glu Lys Ser Phe Ala He Gly Val Gin Phe Leu Leu Met Arg 
545 550 555 560 

Leu Leu Alo Trp Leu Pro Ala Pro Ser Leu Tyr Gly Leu Leu He Asp 
565 570 575 

60 

Ser Ser Cys Val Arg Trp Asn Tyr Leu Cys Ser Gly Arg Arg Gly Ala 
580 585 590 
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Cys Ala Tyr Tyr Asp Asn Asp Ala Leu Arg Asn Arg Tyr Leu Glv Leu 
5 595 600 605 

Gin Met Val Tyr Lys Ala Leu Gly Thr Leu Leu Leu Phe Phe He Ser 
610 615 S20 

10 Trp Arg- Mat Lys Lys Asn Arg Glu Tyr Ser Leu Gin Glu Asn Thr Ser 
625 €30 635 640 



15 



30 



40 



60 



Gly Leu He 



<210> 5 
<211> 1127 
<212> DNA 
20 <213> Unknown 

<220> 

<221> CDS 

<222> (99). .(999) 

25 



<220> 

<221> misc_dif ference 
<222> (367) 

<223> may be A; translation would be Asn 
<220> 

<223> Description of Unknown Organism :primate 
<400> 5 

35 cgcaggcgga ccgggggcaa aggaggtggc atgtcggtca ggcacagcag ggtcctgtgt 60 

ccgcgctgag ccgcgctctc cctgctccag caaggacc atg agg gcg ctg gag ggg 116 

Met Arg Ala Leu Glu Gly 
1 5 

cca ggc ctg teg ctg ctg tgc ctg gtg ttg gcg ctg cct gec ctg ctg 164 
Pro Gly Leu Ser Leu Leu Cys Leu Val Leu Ala Leu Pro Ala Leu Leu 
10 15 20 

45 ccg gtg ccg get gta cgc gga gtg gca gaa aca ccc acc tac ccc tgg 212 
Pro Val Pro Ala Val Arg Gly Val Ala Glu Thr Pro Thr Tyr Pro Trp 
25 30 35 

egg gac gca gag aca ggg gag egg ctg gtg tgc gec cag tgc ccc cca 260 
50 Arg Asp Ala Glu Thr Gly Glu Arg Leu Val Cys Ala Gin Cys Pro Pro 
40 45 50 

ggc acc ttt gtg cag egg ccg tgc cgc cga gac age ccc atg acg tgt 308 
Gly Thr Phe Val Gin Arg Pro Cys Arg Arg Asp Ser Pro Met Thr Cys 
55 55 60 65 70 

ggc ccg tgt cca ccg cgc cac tac acg cag ttc tgg aac tac ctg gag 356 
Gly Pro Cys Pro Pro Arg His Tyr Thr Gin Phe Trp Asn Tyr Leu Glu 
75 80 85 



cgc tgc cgc tac tgc tac gtc etc tgc ggg gag cgt gag gag gag gca 
Arg Cys Arg Tyr Cys Tyr Val Leu Cys Gly Glu Arg Glu Glu Glu Ala 
90 95 100 
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egg get Cgc cac gec acc cac aac ogt gec tgc cgc cgc cgc acc ggc 452 
5 Arg Ala Cys Hia Ala Thr His Asn Arg Ala Cys Arg Cys Arg Thr Gly 
105 110 115 

ttc ttc gcg cac get ggt ttc tgc ttg gag cac gca teg tgt cca cct 500 
Phe Phe Ala His Ala Gly Phe Cys Leu Glu His Ala Ser Cys Pro Pro 
10 120 125 130 

ggt gec ggc gtg att gec ccg ggc acc ccc age cag aac acg cag tgc 548 

Gly Ala Gly Val lie Ala Pro Gly Thr Pro Ser Gin Asn Thr Gin Cys 

135 140 145 150 

IS 

cag ccg tgc ccc cca ggc acc ttc tea gec age age tec age tea gag 596 

Gin Pro Cys Pro Pre Gly Thr Phe Ser Ala Ser Ser Ser Ser Ser Glu 

155 160 165 

20 cog tgc cag ccc cac cgc aac tgc acg gec ctg ggc ctg gec etc aat 644 
Gin Cys Gin Pro His Arg Asn Cys Thr Ala Leu Gly Leu Ala Leu Asn 
170 175 180 

gtg cca ggc tct tec tec cat gac acc ctg tgc acc age tgc acc ggc 692 
25 Val Pro Gly Ser Ser Ser His Asp Thr Leu Cys Thr Ser Cys Thr Gly 
185 190 195 

ttc ccc etc age acc agg gca cca gga get gag gag tgt gag cgt gec 740 
Phe Pro Leu Ser Thr Arg Val Pro Gly Ala Glu Glu Cys Glu Arg Ala 
30 200 205 210 

gtc ate gac ttt gtg get ttc cag gac ate tec ate aag agg ctg cag 788 

Val lie Asp Phe Val Ala Phe Gin Asp lie Ser lie Lys Arg Leu Gin 

215 220 225 230 

35 

egg ctg ctg cag gec etc gag gec ccg gag ggc tgg ggt ccg aca cca 836 

Arg Leu Leu Gin Ala Leu Glu Ala Pro Glu Gly Trp Gly Pro Thr Pro 

235 240 245 

agg gcg ggc cgc gcg gec ttg cag ctg aag ctg cgt egg egg etc acg 884 
Arg Ala Gly Arg Ala Ala Leu Gin Leu Lys Leu Arg Arg Arg Leu Thr 
250 255 260 

gag etc ctg ggg gcg cag gac ggg gcg ctg ctg gtg egg ctg ctg cag 932 
45 Glu Leu Leu Gly Ala Gin Asp Gly Ala Leu Leu Val Arg Leu Leu Gin 
265 270 275 

gcg ctg cgc gtg gec agg otg ccc ggg ctg gag egg age gtc cgt gag 980 
Ala Leu Arg Val Ala Arg Met Pro Gly Leu Glu Arg Ser Val Arg Glu 
50 280 285 290 

cgc ttc etc cct gtg cac tgatectggc cccctcttat ttattctaca 1028 
Arg Phe Leu Pro Val His 
295 300 



40 



55 



60 



tccttggcac cccacttgca ctgaaagagg ctttttttta aatagaagaa atgaggtttc 1088 
ttaaagctta tttttataaa gctttttcat aaaaaaaaaa aaaaaaaaa 1137 



<210> 6 
<211> 300 
<212> PRT 
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<213> Unknown 

5 

<400> 6 

Met Arg Ala Leu Glu Gly Pro Gly Leu Ser Leu Leu Cys Leu Val Leu 
1 5 10 15 

10 Ala Leu Pro Ala Leu Leu Pro Val Pro Ala Val Arg Gly Val Ala Glu 
20 25 30 

Thr Pro Thr Tyr Pro Trp Arg Asp Ala Glu Thr Gly Glu Arg Leu Val 
35 40 45 

15 

Cys Ala Gin Cys Pro Pro Gly Thr Phe Val Gin Arg Pro Cys Arg Arg 
50 55 60 

Asp Ser Pro Met Thr Cys Gly Pro Cys Pro Pro Arg His Tyr Thr Gin 
20 65 70 75 80 

Phe Trp Asn Tyr Leu Glu Arg Cys Arg Tyr Cys Tyr Val Leu Cys Gly 
85 90 95 

25 Glu Arg Glu Glu Glu Ala Arg Ala Cys His Ala Thr His Asn Arg Ala 
100 105 110 

Cys Arg Cys Arg Thr Gly Phe Phe Ala His Ala Gly Phe Cys Leu Glu 
115 120 125 

30 

His Ala Ser Cys Pro Pro Gly Ala Gly Val He Ala Pro Gly Thr Pro 
130 135 140 

Ser Gin Asn Thr Gin Cys Gin Pro Cys Pro Pro Gly Thr Phe Ser Ala 
35 145 150 155 160 

Ser Ser Ser Ser Ser Glu Gin Cys Gin Pro His Arg Asn Cys Thr Ala 
1S5 170 175 

40 Leu Gly Leu Ala Leu Asn Val Pro Gly Ser Ser Ser His Asp Thr Leu 
180 185 190 

Cys Thr Ser Cys Thr Gly Phe Pro Leu Ser Thr Arg Val Pro Gly Ala 
195 200 205 

45 

Glu Glu Cys Glu Arg Ala Val lie Asp Phe Val Ala Phe Gin Asp lie 
210 215 220 

Ser He Lys Arg Leu Gin Arg Leu Leu Gin Ala Leu Glu Ala Pro Glu 
50 225 230 235 240 

Gly Trp Gly Pro Thr Pro Arg Ala Gly Arg Ala Ala Leu Gin Leu Lys 
245 250 255 

55 Leu Arg Arg Arg Leu Thr Glu Leu Leu Gly Ala Gin Asp Gly Ala Leu 
260 265 270 

Leu Val Arg Leu Leu Gin Ala Leu Arg Val Ala Arg Met Pro Gly Leu 
275 280 285 

60 

Glu Arg Ser Val Arg Glu Arg Phe Leu Pro Val His 
290 295 300 
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<21C> 1 

5 <211> 1031 

<212> DNA 

<213> Unknown 

<220> 
10 <221-- CDS 

<222> (402) .. (1031) 



15 



20 



60 



<220> 

<223> Description of Unknown Organisrrwprimate 
<400> 7 

ccgactcant ccctcgccga ccagtctggg cagcggagga gggtggttgg cagtggctgg 60 



aagcttcgct atgggaagtc gttcctttgc tctctcgcgc ccagtcctcc tccctggttc 120 

tcctcagccg ccgtcggagg agagcacccg gagacgcggg ctgcagtcgc ggcggcttct 180 

ccccgcctgg gcggccgcgc cgctgggcag gtgctgagcg cccctagagc ctcccttgcc 240 

25 gcctccctcc tctgcccggc cgcagcagtg cacatggggt gttggaggta gatgggctcc 300 

cggcccggga ggcggcggcg gacgcggcgc tgggcagaag cagccgccga ttccagctgc 3 60 

cccgcgcgcc ccgggcgccc ctgcgagtcc ccggttcagc c atg ggg acc tct ccg 416 
30 Met Gly Thr Ser Pro 

1 5 

age age age acc gec etc gec tec tgc age cgc ate gee cgc cga gec 464 
Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg lie Ala Arg Arg Ala 
35 10 15 2C 

aca gee acg atg ate gcg ggc tec ctt etc ctg ctt gga ttc ctt age 512 

Thr Ala Thr Met lie Ala Gly Ser Leu Leu Leu Leu Gly Phe Leu Ser 

25 30 35 

40 

acc acc aca get cag cca gaa cag aag gec teg aat etc att ggc aca 560 

Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser Asn Leu He Gly Thr 

40 45 SO 

45 tac cgc cat gtt gac cgt gec acc ggc cag gtg eta acc tgt gac aag 608 
Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val Leu Thr Cys Asp Lys 
55 60 65 

tgt cca gca gga acc tat gtc tct gag cat tgt acc aac aca age tgc 656 
50 Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys Thr Asn Thr Ser Cys 
70 75 80 85 

gcg tct ggc age agt tgc cct gtg ggg acc ttt acc agg cat gag aat 7 04 
Ala Ser Gly Ser ser Cys Pro Val Gly Thr Phe Thr Arg His Glu Asn 
55 90 95 100 

ggc ata gag aaa tgc cat gac tgt agt cag cca tgc cca tgg cca atg 7 52 
Gly He Glu Lys Cys His Asp Cys Ser Gin Pro Cys Pro Trp Pro Met 
105 110 115 



att gag aaa tta cct tgt get gec ttg act gac cga gaa tgc act tgc 800 
He Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp Arg Glu Cys Thr Cys 
120 125 130 
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cca cct ggc atg ttc cag tct aac get acc tgt gec ccc cat acg gtg 848 
5 Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys Ala Pro His Thr Val 
135 140 145 

tgt cct gtg ggt tgg ggt gtg egg aag aaa ggg aca gag act gag gat 896 
Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly Thr Glu Thr Glu Asp 
10 150 155 160 165 

gtg egg tgt aag cag tgt get egg ggg tac ttc tea gat gtg cct tct 944 
Val Arg Cys Lys Gin Cys Ala Arg Gly Tyr Phe Ser Asp Val Pro Ser 
170 175 180 

15 

agt gcg atg aac gca aag cat aca cag act gtc tgg ate aga acc tgg 992 
Ser Val Met Asn Ala Lys His Thr Gin Thr Val Trp lie Arg Thr Trp 
185 190 195 

20 ttg gtg ate aag ccg ggg gga cca agg aga cag aca act 1031 
Leu Val lie Lys Pro Gly Gly Pro Arg Arg Gin Thr Thr 
200 205 210 

25 <210> 8 

<211> 210 
<212> PRT 
<213> Unknown 

30 <400> 8 

Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
15 10 15 

He Ala Arg Arg Ala Thr Ala Thr Met He Ala Gly Ser Leu Leu Leu 
35 20 25 30 

Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser 
35 40 45 

40 Asn Leu He Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val 

50 55 60 

Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys 
65 70 75 80 

45 

Thr Asn Thr Ser Cys Ala Ser Gly Ser Ser Cys Pro Val Gly Thr Phe 
85 90 95 

, Thr Arg His Glu Asn Gly He Glu Lys Cys His Asp Cys Ser Gin Pro 
50 100 105 110 

Cys Pro Trp Pro Met He Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp 
115 120 125 

55 Arg Glu Cys Thr Cys Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys 
130 135 14C 

Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly 
145 150 155 160 

60 

Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Tyr Phe 
165 170 175 
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Ser Asp Val Pro Ser Ser Val Met Asn Ala Lys His Thr Gin Thr Val 
5 180 185 190 

Tip lie Arg Thr Trp Leu Val lie Lys Pro Gly Gly Pro Arg Arg Gin 
195 200 205 

10 Thr Thr 
210 



<210> 9 

15 <211> 2877 

<212> DNA 

<213> Unknown 

<220> 

20 <221> CDS 

<222> (410) .. (2374) 



25 



30 



<220> 

<223> Description of Unknown Organism: primate 
<400> 9 



ggcacgagcc 


gactcagtcc 


ctcgccgacc 


agtctgggca 


gcggaggagg 


gtggttggca 


60 


gtggctggaa 


gcttcgctat 


gggaagtcgt 


Ccctttgctc 


tctcgcgccc 


agtcctcctc 


120 


cctggttctc 


cccagccgct 


gtcggaggag 


agcacccgga 


gacgcgggct 


gcagtcgcgg 


180 


cggcttctcc 


ccgcctgggc 


ggccgcgccg 


ctgggcaggt 


gctgagcgcc 


cctagcgcct 


240 


cccttgccgc 


ctccctcctc 


tgcccggccg 


cagcagtgca 


catggggtgt 


tggaggtaga 


300 


tgggctcccg 


gcccgggagg 


cggcggtgga 


tgcggcgctg 


ggcagaagca 


gccgccgatt 


360 


ccagctgccc 


cgcgcgcccc 


gggcccccct 


gcgagtcccc 


ggttcagcc atg ggg acc 


418 



40 Met Gly Thr 

1 

tct ccg age age age acc gec etc gee tec tgc age cgc ate gec cgc 466 

Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg lie Ala Arg 
45 5 10 15 

cga gec aca gec acg atg ate gcg ggc tec ctt etc ctg ctt gga ttc 514 

Arg Ala Thr Ala Thr Met lie Ala Gly Ser Leu Leu Leu Leu Gly Phe 
20 25 30 35 

50 

ctt age acc acc aca get cag cca gaa cag aag gee teg aat etc att 562 

Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser Asn Leu lie 
40 45 50 

55 ggc aca tac cgc cat gtt gac cgt gec acc ggc cag gtg eta acc tgt 610 

Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val Leu Thr Cys 
55 60 65 

gac aag tgt cca gca gga acc tat gtc tct gag cat tgt acc aac aca 658 

60 Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys Thr Asn Thr 
70 75 80 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/12366 



18 

age ctg cgc gtc tgc age agt tgc cct gtg ggg acc ttt acc agg cat 706 
5 Ser Leu Arg Val Cys Ser Ser Cys Pro Vol Gly Thr Phe Thr Arg His 
85 90 95 

gag aat ggc ata gag aaa tgc cat gac tgt agt cag cca tgc cca tgg 754 
Glu Asn Gly He Glu Lys Cys His Asp Cys Ser Gin Pro Cys Pro Trp 
10 100 105 110 115 

cca atg att gag aaa tta cct tgt get gec ttg act gac cga gaa tgc 802 

Pro Met He Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp Arg Glu Cys 

120 125 130 

15 

act tgc cca cct ggc stg ttc cag tct aac get acc tgt gec ccc cat 850 

Thr Cys Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys Ala Pro His 
135 140 145 

20 acg gtg tgt cct gtg ggt tgg ggt gtg egg aag aaa ggg aca gag act 898 
Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly Thr Glu Thr 
150 155 160 

gag gat gtg egg tgt aag cag tgt get egg ggc acc ttc tea gat gtg 946 
25 Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe Ser Asp Val 
165 170 175 

cct ~ct agt gtg atg aaa tgc aaa gca tac aca gac tgt ctg agt cag 994 
Pro Ser Ser Val Met Lys Cys Lys Ala Tyr Thr Asp Cys Leu Ser Gin 
30 180 185 190 195 

aac ctg gtg gtg ate aag ccg ggg acc aag gag aca gac aac gtc tgt 1042 

Asn Leu Val Val lie Lys Pro Gly Thr Lys Glu Thr Asp Asn Val Cys 
200 205 210 

35 

ggc aca etc ccg tec ttc tec age tec acc tea cct tec cct ggc aca 1090 

Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser Pro Gly Thr 
215 220 225 

40 gec ate ttt cca cgc cct gag cac atg gaa acc cat gaa gtc cct tec 1138 
Ala He Phe Pro Arg Pro Glu His Met Glu Thr His Glu Val Pro Ser 
230 235 240 

tec act tat gtt ccc aaa ggc atg aac tea aca gaa tec aac tct tct 1186 
45 Ser Thr Tyr Val Pro Lys Gly Met Asn Ser Thr Glu Ser Asn Ser Ser 
245 250 255 

gec tct gtt aga cca aag gta ctg agt age ate cag gaa ggg aca gtc 1234 
Ala Ser Val Arg Pro Lys Val Leu Ser Ser He Gin Glu Gly Thr Val 
50 260 255 270 275 

cct gac aac aca age tea gca agg ggg aag gaa gac gtg aac aag acc 1282 
Pro Asp Asn Thr Ser Ser Ala Arg Gly Lys Glu Asp Val Asn Lys Thr 
280 285 290 



55 



etc cca aac ctt cag gta gtc aac cac cag caa ggc ccc cac cac aga 1330 
Leu Pro Asn Leu Gin Val Val Asn His Gin Gin Gly Pro His His Arg 
295 300 305 



60 cac ate ctg aag ctg ctg ccg tee atg gag gec act ggg ggc gag aag 1378 
His He Leu Lys Leu Leu Pro Ser Met Glu Ala Thr Gly Gly Glu Lys 
310 315 320 
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tec age acg ccc acc aag ggc ccc aag agg gga cat cct ego cag aac 1426 
5 Ser Ser Thr Pro lie Lys Gly Pro Lys Arg Gly His Pro Arg Gin Asn 
325 330 335 

eta cac aag cat ttt gac ate aat gag cat ttg ccc tgg atg att gtg 1474 
Leu His Lys His Phe Asp lie Asn Glu His Leu Pro Trp Met lie Val 
10 340 345 350 355 

ctt ttc ctg ctg ctg gtg ctt gtg gtg att gtg gtg tgc agt ate egg 1522 

Leu Phe Leu Leu Leu Val Leu Val Val He Val Val Cys Ser lie Arg 
360 365 370 

15 

aaa age ;cg agg act ctg aaa aag ggg ccc egg cag gat ccc agt gec 1570 

Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp Pro Ser Ala 
375 380 385 

20 att gtg gaa aag gca ggg ctg aag aaa tec atg act cca acc cag aac 1618 
lie Val Glu Lys Ala Gly Leu Lys Lys Ser Met Thr Pro Thr Gin Asn 
390 395 40C 

egg gag aaa tgg ate tac tac tgc aat ggc cat ggt ate gat ate ctg 1666 
25 Arg Glu Lys Trp lie Tyr Tyr Cys Asn Gly His Gly lie Asp lie Leu 
405 410 415 

aag ctt gta gca gec caa gtg gga age cag tgg aaa gat ate tat cag 1714 
Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp Lys Asp lie Tyr Gin 
30 420 425 430 435 

ttt ctt tgc aat gee agt gag agg gag gtt get get ttc tec aat ggg 17S2 

Phe Leu Cys Asn Ala Ser Glu Arg Glu Val Ala Ala Phe Ser Asn Gly 
440 445 450 

35 

tac aca gee gac cac gag egg gec tac gca get ctg cag cac tgg acc 1810 

Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala Leu Gin His Trp Thr 
455 460 465 

40 ate egg ggc ccc gag gec age etc gee cag eta att age gec ctg cgc 1858 
He Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu lie Ser Ala Leu Arg 
470 475 480 

cag cac egg aga aac gat gtt gtg gag aag att cgt ggg ctg atg gaa 1906 
45 Gin His Arg Arg Asn Asp Val Val Glu Lys He Arg Gly Leu Met Glu 
485 490 495 

gac acc acc cag ctg gaa act gac aaa eta get etc ccg atg age ccc 1954 
Asp Thr Thr Gin Leu Glu Thr Asp Lys Leu Ala Leu Pro Met Ser Pro 
50 500 505 510 515 



55 



age ccg ctt age ccg ago ccc ate ccc age ccc aac gcg aaa ctt gag 2002 
Ser Pro Leu Ser Pro Ser Pro He Pro Ser Pro Asn Ala Lys Leu Glu 
520 525 530 

aat tec get etc ctg acg gtg gag cct tec cca cag gac aag aac aag 2050 
Asn Ser Ala Leu Leu Thr Val Glu Pro Ser Pro Gin Asp Lys Asn Lys 
535 540 545 

60 ggc ttc ttc gtg gat gag teg gag ccc ctt etc cgc tgt gac tct aca 2098 
Gly Phe Phe Val Asp Glu Ser Glu Pro Leu Leu Arg Cys Asp Ser Thr 
550 555 560 
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tec age ggc. tec tec gcg ctg age agg aac ggt tec ttt att acc aaa 2146 
5 Ser Ser Gly Ser Ser Ala Leu Ser Arg Asn Gly Ser Phe lie Thr Lys 
565 570 575 

gaa aag aag gac aca gtg ttg egg cag gta cgc ctg gac ccc tgt gac 2194 
Glu Lys Lys Asp Thr Val Leu Arg Gin Val Arg Leu Asp Pro Cys Asp 
10 580 585 590 595 

ttg cag cct ate ttt gat gac atg etc cac ttt eta aat cct gag gag 2242 
Leu Gin Pro He Phe Asp Asp Met Leu His Phe Leu Asn Pro Glu Glu 
600 605 610 

15 

ctg egg gtg att gaa gag att ccc cag get gag gac aaa eta gac egg 2290 
Leu Arg Val He Glu Glu He Pro Gin Ala Glu Asp Lys Leu Asp Arg 
615 620 625 

20 eta ttc gaa att att gga gtc aag age cag gaa gec age cag acc etc 2338 
Leu Phe Glu He He Gly Val Lys Ser Gin Glu Ala Ser Gin Thr Leu 
530 635 640 

ctg gac tct gtc tat age cot ctt cct gac ctg ctg tagaacatag 23B4 
25 Leu Asp Ser Val Tyr Ser His Leu Pro Asp Leu Leu 
645 650 655 

ggataetgea rtctggaaat tactcaattt agtggcaggg tggtttttta atb:ccttct 2444 

30 gtgtctgatt tttgttgttt ggggtgtgtg tgtgtgtttg tgtgtgtgtg tgtgtgtgtg 2504 

tgtgtgtgtg tttaacagag aatatggcca gtgcttgagt tctttctcct tctctctctc 2564 

tctttttttt ttaaataact ctuctgggaa gttggtttat aagcctttgc caggtgtaac 2624 

tgttgtgaaa tacccaccac taaagttttc taagttccat attttctcca ttttgectte 2684 

ttatgtattt tcaagattat tctgtgcact ttaaatttac tcaacttacc ataaatgeag 2744 

40 tgtgactttt cccacacact ggattgtgag gctcttaact tcttaaaagt ataatggcat 2804 

cttgtgaate ctataagcag tctttatgtc tcttaacatt cacacctact ttttaaaaac 2864 

aaatattatt act 2877 

45 

<210> 10 
<211> 655 
<212> PRT 
50 <213> Unknown 

<400> 10 

Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
15 10 15 



35 



55 



He Ala Arg Arg Ala Thr Ala Thx Met He Ala Gly Ser Leu Leu Leu 
20 25 30 



Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser 
60 35 40 45 

Asn Leu He Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val 
50 55 60 
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Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Vol Ser Glu His Cys 
5 65 70 75 80 

Thr Asn Thr Ser Leu Arg Val Cys Ser Ser Cys Pro Val Gly Thr Phe 
R5 90 95 

10 Thr Arg His Glu Asn Gly He Glu Lys Cys His Asp Cys Ser Gin Pro 
100 105 110 

Cys Pro Trp Pro Met He Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp 
115 120 125 

15 

Arg Glu cys Thr Cys Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys 
130 135 140 

Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly 
20 145 150 155 160 

Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe 
165 170 175 

25 Ser Asp Val Pro Ser Ser Val Met Lys Cys Lys Ala Tyr Thr Asp Cys 
180 185 190 

Leu Ser Gin Asn Leu Val Val He Lys Pro Gly Thr Lys Glu Thr Asp 
195 200 205 

30 

Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser 
210 215 220 

Pro Gly Thr Ala He Phe Pro Arg Pro Glu His Met Glu Thr His Glu 
35 225 230 235 240 

Val Pro Ser Ser Thr Tyr Val Pro Lys Gly Met Asn Ser Thr Glu Ser 
245 250 255 

40 Asn Ser Ser Ala Ser Val Arg Pro Lys Val Leu Ser Ser He Gin Glu 
260 265 270 

Gly Thr Val Pro Asp Asn Thr Ser Ser Ala Arg Gly Lys Glu Asp Val 
275 280 285 

Asn Lys Thr Leu Pro Asn Leu Gin Val Val Asn His Gin Gin Gly Pro 
290 295 300 

His His Arg His He Leu Lys Leu Leu Pro Ser Met Glu Ala Thr Gly 
50 305 310 315 320 

Gly Glu Lys Ser Ser Thr Pro He Lys Gly Pro Lys Arg Gly His Pro 
325 330 335 

55 Arg Gin Asn Leu His Lys His Phe Asp He Asn Glu His Leu Pro Trp 
340 345 350 

Met He Val Leu Phe Leu Leu Leu Val Leu Val Val He Val Val Cys 
355 360 365 

60 

Ser He Arg Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp 
370 375 380 
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Pro Ser Ala He Val Glu Lys Ala Gly Leu Lys Lys Ser Met Thr Pro 
5 385 390 395 400 

Thr Gin Asn Arg Glu Lys Trp lie Tyr Tyr Cys Asn Gly His Gly lie 
40S 410 415 

10 Asp lie Leu Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp Lys Asp 
420 425 430 

He Tyr Gin Phe Leu Cys Asn Ala Ser Glu Arg Glu Val Ala Ala Phe 
435 440 44S 

15 

Ser Asn Gly Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala Leu Gin 
450 455 460 

His Trp Thr He Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu He Ser 
20 465 470 475 480 

Ala jeu Arg Gin His Arg Arg Asn Asp Val Val Glu Lys He Arg Gly 
485 490 495 

25 Leu Met Glu Asp Thr Thr Gin Leu Glu Thr Asp Lys Leu Ala Leu Pro 
500 505 510 

Met Ser Pro Ser Pro Leu Ser Pro Ser Pro He Pro Ser Pro Asn Ala 
515 520 525 

30 

Lys Leu Glu Asn Ser Ala Leu Leu Thr Val Glu Pro Ser Pro Gin Asp 
530 535 540 

Lys Asn Lys Gly Phe Phe Val Asp Glu Ser Glu Pro Leu Leu Arg Cys 
35 545 550 555 560 

Asp Ser Thr Ser Ser Gly Ser Ser Ala Leu Ser Arg Asn Gly Ser Phe 
565 570 575 

40 He Thr Lys Glu Lys Lys Asp Thr Val Leu Arg Gin Val Arg Leu Asp 
580 585 590 

Pro Cys Asp Leu Gin Pro He Phe Asp Asp Met Leu His Phe Leu Asn 
595 600 605 

45 

Pro Glu Glu Leu Arg Val He Glu Glu He Pro Gin Ala Glu Asp Lys 
610 615 620 

Leu Asp Arg Leu Phe Glu He He Gly Val Lys Ser Gin Glu Ala Ser 
50 625 630 635 640 

Gin Thr Leu Leu Asp Ser Val Tyr Ser His Leu Pro Asp Leu Leu 
645 650 655 

55 

<210> 11 
<21L> 1474 
<212> DNA 
<213> Unknown 

60 

<220> 

<223> Description of Unknown Organ ism: primate 
<220> 



SUBSTITUTE SHEET ( rule 26 ) 
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23 

<221> CDS 
5 <222> CD . . (1332) 

<400> 11 

atg ggg acc tct ccg age age age acc gec etc gec tec tgc age cgc 48 

Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
10 1 5 10 IS 

ate gec cgc cga gec aca gec acg atg ate gcg ggc tec ctt etc ctg 96 

He Ala Arg Arg Ala Thr Ala Thr Met lie Ala Gly Ser Leu Leu Leu 
20 25 30 

15 

ctt gga ttc ctt age acc acc aca get cag cca gaa cag aag gee teg 144 

Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser 
35 40 45 

20 oat etc att ggc aca tac cgc cat gtt gac cgt gec acc ggc cag gtg 192 
Asn Leu He Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val 
50 55 60 

eta acc tgt gac aag cgt cca gca gga acc tat gtc tct gag cat tgt 240 
25 Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys 
65 70 75 90 

acc aac aca age ctg cgc gtc tgc age agt tgc cct gtg ggg acc ttt 288 
Thr Asn Thr Ser Leu Arg Val Cys Ser Ser Cys Pre Val Gly Thr Phe 
30 85 • 90 95 

acc agg cat gag aat ggc ata gag aaa tgc cat gac tgt agt cag cca 336 
Thr Arg His Glu Asn Gly He Glu Lys Cys His Asp Cys Ser Gin Pro 
100 105 110 



35 



55 



tgc cca tgg cca atg att gag aaa tta cct tgt get gec ttg act gac 384 
Cys Pro Trp Pro Met He Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp 
115 120 125 



40 cga gaa tgc act tgc cca cct ggc atg ttc cag tct aac get acc tgt 432 
Arg Glu Cys Thr Cys Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys 
130 135 140 

gec ccc cat acg gtg tgt cct gtg ggt tgg ggt gtg egg aag aaa ggg 480 
45 Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly 
145 150 155 160 

aca gag act gag gat gtg egg tgt aag cag tgt get egg ggt acc ttc 528 
Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe 
50 165 170 175 

tea gat gtg cct tct agt gtg atg aaa tgc aaa gca tac aca gac tgt 576 
Ser Asp Val Pro Ser Ser Val Met Lys Cys Lys Ala Tyr Thr Asp Cys 
180 185 190 



ctg agt cag aac ctg gtg gtg ate aag ccg ggg acc aag gag aca gac 624 
Leu Ser Gin Asn Leu Val Val He Lys Pro Gly Thr Lys Glu Thr Asp 
195 200 205 



60 aac gtc tgt ggc aca etc ccg tec ttc tec age tec acc tea cct tec 672 
Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser 
210 215 220 
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cct ggc aca gcc ate ttt cca cgc cct gag cac atg gaa acc cat gaa 720 
5 Pro Gly Thr Ala lie Phe Pro Arg Pro Glu His Met Glu Thr His Glu 
225 230 235 240 

gtc cct tec tec act tat gtt ccc aaa ggc atg aac tea aca gaa tec 768 
Val Pro Ser Ser Thr Tyr Val Pro Lys Gly Met Asn Ser Thr Glu Ser 
10 245 250 255 

aac tec tct gcc tct gtt aga cca aag gta ctg age age ate cag gaa 816 
Asn Ser Ser Ala Ser Val Arg Pro Lys Val Leu Ser Ser Tie Gin Glu 
260 265 270 

15 

ggg aco gtc cct gac aac aca age tea gca agg ggg aag gaa gac gtg 864 
Gly Thr Val Pro Asp Asn Thr Ser Ser Ala Arg Gly Lys Glu Asp Val 
275 280 285 

20 aac aag acc etc cca aac ctt cag gta gtc aac cac cag caa ggc ccc 912 
Asn Lys Thr Leu Pro Asn Leu Gin Val Val Asn His Gin Gin Gly Pro 
290 295 300 

cac cac aga cac ate ctg aag ctg ctg ccg tec atg gag gcc act ggg 960 
25 His His Arg His lie Leu Lys Leu Leu Pro Ser Met Glu Ala Thr Gly 
305 310 315 320 

ggc gag aag tec age acg ccc ate: aag ggc ccc aac agg gga cat cct 1008 
Gly Glu Lys Ser Ser Thr Pro He Lys Gly Pro Lys Arg Gly His Pro 
30 325 330 335 

aga cag aac eta cac aag cat ttt gac ate aat gag cat ttg ccc tgg 1056 

Arg Gin Asn Leu His Lys His Phe Asp He Asn Glu His Leu Pro Tip 
340 345 350 

35 

atg att gtg ctt ttc ctg ctg ctg gtg ctt gtg gtg att gtg gtg tgc 1104 

Met He Val Leu Phe Leu Leu Leu Val Leu Val Val lie Val Val Cys 
355 360 365 

40 agt ate egg aaa age teg agg act ctg aaa aag ggg ccc egg cag gat 1152 
Ser He Arg Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp 
370 375 380 

ccc agt gcc att gtg gaa aag gca ggg ctg aag aaa tec atg act cca 1200 
45 Pro Ser Ala He Val Glu Lys Ala Gly Leu Lys Lys Ser Met Thr Pro 
385 390 395 400 

acc cag aac egg gag aaa tgg ate tac tac tgc aat ggc cat gga ccc 1248 
Thr Gin Asn Arg Glu Lys Trp He Tyr Tyr Cys Asn Gly His Gly Pro 
50 405 410 415 

cat gat gag gag tgg ggg ttg atg gag aga cat att caa gat att tat 1296 

His Asp Glu Glu Trp Gly Leu Met Glu Arg His He Gin Asp He Tyr 
420 425 430 

55 

att caa aga age aat caa gat tea gaa aga tgg ggt tgataatttt 1342 

He Gin Arg Ser Asn Gin Asp Ser Glu Arg Trp Gly 

435 440 

60 tacttcaccc tgggaggcag catagtgcag tgaaaggtat cgatatcctg aaccttgtag 1402 

cagcccaagt gggaagccag tggaaagata tctatcagtt tetttgeaat gecagtgaga 1462 

gggaggttgc tg 1474 
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5 

<210> 12 
<211> 444 
<212> PRT 
<213> Unknown 

10 

<400> 12 

Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
15 10 15 

15 lie Ala Arg Arg Ala ?hr Ala Thr Met He Ala Gly Ser Leu Leu Leu 
20 25 30 

Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser 
35 40 45 

20 

Asn Leu lie Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val 
50 55 60 

Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys 
25 65 70 75 80 

Thr Asn Thr Ser Leu Arg Val Cys Ser Ser Cys Pro Val Gly Thr Phe 
85 90 95 

30 Thr Arg His Glu Asn Gly lie Glu Lys Cys His Asp Cys Ser Gin Pro 
100 105 110 

Cys Pro Trp Pro Met lie Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp 
115 120 125 

35 

Arg Glu Cys Thr Cys Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys 
130 135 140 

Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly 
40 145 150 155 160 

Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe 
165 170 175 

45 Ser Asp Val Pro Ser Ser Val Met Lys Cys Lys Ala Tyr Thr Asp Cys 
180 185 190 

Leu Ser Gin Asn Leu Val Val lie Lys Pro Gly Thr Lys Glu Thr Asp 
195 200 205 

50 

Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser 
210 215 220 

Pro Gly Thr Ala He Phe Pro Arg Pro Glu His Met Glu Thr His Glu 
55 225 230 235 240 

Val Pro Ser Ser Thr Tyr Val Pro Lys Gly Met Asn Ser Thr Glu Ser 
245 250 255 

60 Asn Ser Ser Ala Ser Val Arg Pro Lys Val Leu Ser Ser He Gin Glu 
260 265 270 

Gly Thr Val Pro Asp Asn Thr Ser Ser Ala Arg Gly Lys Glu Asp Val 
275 280 285 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/12366 



26 



5 Asn Lys Thr Leu Pro Asn Leu Gin Val Val Asn His Gin Gin Gly Pro 
290 295 300 

His His Arg His He Leu Lys Leu Leu Pro Ser Met Glu Ala Thr Gly 
305 310 315 320 

10 

Gly Glu Lys Ser Ser Thr Pro lie Lys Sly Pro Lys Arg Gly His Pro 
325 330 335 

Arg Gin Asn Leu His Lys His Phe Asp lie Asn Glu His Leu Pro Trp 
15 34C 345 350 

Met lie Val Leu Phe Leu Leu Leu Val Leu Val Val He Val Val Cys 
355 360 365 

20 Ser lie Arg Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp 
370 375 380 



25 



Pro Ser Ala lie Val Glu Lys Ala Gly Leu Lys Lys Ser Met Thr Pro 

385 390 395 400 

Thr Gin Asn Arg Glu Lys Trp He Tyr Tyr Cys Asn Gly His Gly Pro 

405 410 415 



40 



His Asp Glu Glu Trp Gly Leu Met Glu Arg His He Gin Asp He Tyr 
30 420 425 430 

He Gin Arg Ser Asn Gin Asp Ser Glu Arg Trp Gly 
435 440 

35 

<210> 13 
<211> 227 
<212> PRT 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: rodent 
<400> 13 

45 Met Ala Pro Ala Ala Leu Trp Val Ala Leu Val Phe Glu Leu Gin Leu 
15 10 15 

Trp Ala Thr Gly His Thr Val Pro Ala Gin Val Val Leu Thr Pro Tyr 
20 25 30 

50 

Lys Pro Glu Pro Gly Tyr Glu Cys Gin He Ser Gin Glu Tyr Tyr Asp 
35 40 45 

Arg Lys Ala Gin Met Cys Cys Ala Lys Cys Pro Pro Gly Gin Tyr Val 
55 50 55 60 

Lys His Phe Cys Asn Lys Thr Ser Asp Thr Val Cys Ala Asp Cys Glu 
65 70 75 80 

60 Ala Ser Met Tyr Thr Gin Val Trp Asn Gin Phe Arg Thr Cys Leu Ser 
85 90 95 

Cys Ser Ser Ser Cys Thr Thr Asp Gin Val Glu He Arg Ala Cys Thr 
100 105 110 
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5 Lys Gin Gin Asn Arg Val Cys Ala Cys Glu Alo Gly Arg Tyr Cys Ala 
115 120 125 

Leu Lys Thr His Ser Gly Ser Cys Arg Gin Cys Met Arg Leu Ser Lye 
130 135 140 

10 

Cys Gly Pro Gly Phe Gly Val Ala Ser Ser Arg Ala Pro Asn Gly Asn 
145 150 155 160 

Val Leu Cys Lys Ala Cys Ala Pro Gly Thr Phe Ser Asp Thr Thr Ser 
15 165 170 175 

Ser Thr Asp Val Cys Arg Pro His Arg He Cys Ser He Leu Ala He 
180 185 190 

20 Pro Gly Asn Ala Ser Thr Asp Ala Val Cys Ala Pro Glu Ser Pro Thr 
195 200 205 

Leu Ser Ala He Pro Arg Thr Leu Tyr Val Ser Gin Pro Glu Pro Thr 
210 215 220 

25 

Arg Ser Gin 
225 



30 <210> 14 
<211> 225 
<212> PRT 
<213> Unknown 

35 <220> 

<223> Description of Unknown Organism: primate 
<400> 14 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
40 1 5 10 15 

Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 
20 25 30 

45 Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin 
35 40 45 

Thr Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Gly Gin His Ala Lys 
50 55 60 

50 

Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 
65 70 75 80 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cys Leu Ser Cys 
55 85 90 95 

Gly Ser Arg Cys Ser Ser Asp Gin Val Glu Thr Gin Ala Cys Thr Arg 
100 105 110 

60 Glu Gin Asn Arg He Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 
115 120 125 

Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 135 140 
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5 Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 
145 150 155 160 

Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr 
165 170 175 

10 

Asp lie Cys Arg Pro His Gin lie Cys Asn Val Val Ala lie Pro Gly 
180 185 190 

Asn Ala Ser Met Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Ser 
IS 195 200 205 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Ser 
210 215 220 

20 Gin 
225 



<210> 15 
25 <211> 187 
<212> PRT 
<213> Unknown 

<220> 

30 <223> Description of Unknown Organ ism: primate 
<400> 15 

Met Asn Lys Leu Leu Cys Cys Ala Leu Val Phe Leu Asp lie Ser lie 
1 5 10 15 

35 

Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro Pro Gly Thr 
40 35 40 45 

Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro 
50 55 60 

45 Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 

Leu Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu 
85 90 95 

50 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr 
100 105 110 

Leu Glu lie Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe 
55 115 120 125 

Gly Val Val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 140 

60 Cys Pro Asp Gly Phe Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 

Arg Lys His Thr Asn Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys 
165 170 175 
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5 Gly Asn Ala Thr His Asp Asn lie Cys Ser Gly 
180 185 



<210> 16 
10 <211> 636 
<212> DNA 
<213> Unknown 

<220> 

15 <223> Description of Unknown Organism: rodent 

<220> 
<221> CDS 
<222> (104) . . (553) 



20 



40 



<400> 16 

ggcacgaggg cgtttggcgc ggaagtgcta ccaagctgcg gaaagcgtga gtctggagca 60 



cagcactggc gagtagcagg aatoaococg tttggtgaga gcc atg gca etc aag 115 
25 Met Ala Leu Lys 

i 

gtc eta cct eta cac agg acg gtg etc ttc get gcc att etc ttc eta 163 
Val Leu Pro Leu His Arg Thr Val Leu Phe Ala Ala lie Leu Phe Leu 
30 5 10 15 20 

etc cac ctg gca tgt aaa gtg agt tgc gaa acc gga gat tgc agg cag 211 

Leu His Leu Ala Cys Lys Val Ser Cys Glu Thr Gly Asp Cys Arg Gin 

25 30 35 

35 

cag gaa ttc aag gat cga tct gga aac tgt gtc etc tgc aaa cag tgc 259 

Gin Glu Phe Lys Asp Arg Ser Gly Asn Cys Val Leu Cys Lys Gin Cys 
40 45 50 

gga cct ggc atg gag ttg tec aag gaa tgt ggc ttc ggc tat egg gag 307 
Gly Pro Gly Met Glu Leu Ser Lys Glu Cys Gly Phe Gly Tyr Gly Glu 
55 60 65 

gat gca cag tgt gtg ccc tgc agg ccg cac egg ttc aag gaa gac tgg 355 
45 Asp Ala Gin Cys Val Pro Cys Arg Pro His Arg Phe Lys Glu Asp Trp 
70 75 80 

ggt ttc cag aag tgt aag cca tgt gcg gac tgt gcg ctg gtg aac cgc 403 
Gly Phe Gin Lys Cys Lys Pro Cys Ala Asp Cys Ala Leu Val Asn Arg 
50 85 90 95 100 

ttt cag agg gcc aac tgc tea cac acc agt got get gtc tgc ggg gac 451 

Phe Gin Arg Ala Asn Cys Ser His Thr Ser Asp Ala Val Cys Gly Asp 

105 110 115 

55 

tgc ctg cca gga ttt tac egg aag ace aaa ctg gtt ggt ttt caa gac 499 

Cys Leu Pro Gly Phe Tyr Arg Lys Thr Lys Leu Val Gly Phe Gin Asp 

120 125 130 

60 atg gag tgt gtg ccc tgc gga gac cca cct cct ccc tac gaa cca cac 547 
Met Glu Cys Val Pro Cys Gly Asp Pro Pro Pro Pro Tyr Glu Pro His 
135 140 145 
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tgt gag tgatgtgcca agtggcagca gacctttaaa aaaaaaagaa aaaaaaacaa 603 
5 Cys Glu 
150 

acaaaaacaa aaaaaaaaaa aaaaaaaaaa aaa 636 

10 

<210> 17 
<211> 150 
<212> PRT 
< 2 1 3 >■ Unknown 

15 

<400> 17 

Met Ala Leu Lys Val Leu Pro Leu His Arg Thr Val Leu Phe Ala Ala 
1 5 10 15 

20 He Leu Phe Leu Leu His Leu Ala Cys Lys Val Ser Cys Glu Thr Gly 
20 25 30 

Asp Cys Arg Gin Gin Glu Phe Lys Asp Arg Ser Gly Asn Cys Val Leu 
35 40 45 

25 

Cys Lys Gin Cys Gly Pro Gly Met Glu Leu Ser Lys Glu Cys Gly Phe 
50 55 60 

Gly Tyr Cly Glu Asp Ala Gin Cys Val Pio Cys Arg Pro His Arg Phe 
30 65 70 75 80 

Lys Glu Asp Trp Gly Phe Gin Lys Cys Lys Pro Cys Ala Asp Cvs Ala 
85 90 " 95 

35 Leu Val Asn Arg Phe Gin Arg Ala Asn Cys Ser His Thr Ser Asp Ala 
100 105 110 

Val Cys Gly Asp Cys Leu Pro Gly Phe Tyr Arg Lys Thr Lys Leu Val 
115 120 125 

40 

Gly Phe Gin Asp Mec Glu Cys Val Pro Cys Gly Asp Pro Pro Pro Pro 
130 135 140 

Tyr Glu Pro His Cys Glu 
45 145 150 



<2io> ie 

<211> 474 

50 <212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: primate 

55 

<220> 

<221> CDS 

<222> (78! . . (473) 

60 <400> 18 



cgcgctgagg tggatttgta ccggagtccc atttgggagc aagagccatc tactcgtccg 60 
ttaccggcct tcccacc atg gat tgc caa gaa aat gag tac tgg gac caa 110 
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Met Asp Cys Gin Glu Asn Glu Tyr Trp Asp Gin 
5 1 5 10 

tgg ?ga egg tgt gtc acc tgc caa egg tgt ggt cct gga cag gag eta 158 

Trp Gly Arg Cys Val Thr Cys Gin Arg Cys Gly Pro Gly Gin Glu Leu 
15 20 25 

10 

tec aag cat tgt ggt tat gga gag ggt gga gat gec tac tgc aca gec 206 

Ser Lys Asp Cys Gly Tyr Gly Glu Gly Gly Asp Ala Tyr Cys Thr Ala 
30 35 40 

15 tgc cct cct cgc agt aca aaa gca get ggg gec acc aca aat gtc aga 254 
Cys Pro Pro Arg Ser Thr Lys Ala Ala Gly Ala Thr Thr Asn Val Arg 
45 50 55 

gtt gca tea cct gtg ctg tea tea ate gtg ttc aga agg tec aac tgc 302 
2 0 Val Ala Ser Pro Val Leu Ser Ser lie Val Phe Arg Arg Phe Asn Cys 
60 65 70 75 

aca gtn acc tct nat get gtc tg: ggg gga ngg ttt gec caa gtt tct 350 
Thr Xaa Thr Ser Xaa Ala Val Cys Gly Gly Xaa Phe Ala Gin Val Ser 
25 80 85 90 



aac cga aag aca cgc cat tgg aag 
Asn Arg Lys Thr Arg His Trp Lys 
95 

30 

ccg tgg cac aaa gnc aga ccc cca 
Pro Trp His Lys Xaa Arg Pro Pro 
110 115 

35 



get gec agg acc aag gat ggc ate 398 
Ala Ala Arg Thr Lys Asp Gly lie 
100 105 

act tct gan ggt tnc aaa gtg net 446 
Thr Ser Xaa Gly Xaa Lys Val Xaa 
12D 



ttc caa ttg gag ctt aat ggg agg can a 474 
Phe Gin Leu Glu Leu Asn Gly Arg Xaa 
125 130 



40 <210> IS 
<211> 132 
<212> PUT 
<213> Unknown 

45 <400> 19 

Met Asp Cys Gin Glu Asn Glu Tyr Trp Asp Gin Trp Gly Arg Cys Val 
15 10 15 

Thr Cys Gin Arg Cys Gly Pro Gly Gin Glu Leu Ser Lys Asp Cys Gly 
50 20 25 30 

Tyr Gly Glu Gly Gly Asp Ala Tyr Cys Thr Ala Cys Pro Pro Arg Ser 
35 40 45 

55 Thr Lys Ala Ala Gly Ala Thr Thr Asn Val Arg Val Ala Ser Pro Val 
50 55 60 

Leu Ser Ser lie Val phe Arg Arg Phe Asn Cys Thr Xaa Thr Ser Xaa 
65 70 75 80 

60 

Ala Val Cys Gly Gly Xaa Phe Ala Gin Val Ser Asn Arg Lys Thr Arg 
85 90 95 
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His Trp Lys Ala Ala Arg Thr Lys Asp Gly lie Pro Trp His Lys Xaa 
5 100 105 110 

Arg Pro Pro Thr Ser Xaa Gly Xaa Lys Val Xaa Phe Gin Leu Glu Leu 
115 120 125 

10 Asn Gly Arg Xaa 
130 



<210> 20 

15 <211> 546 

<212> DNA 

<213> Unknown 

<220> 

20 <223> Description of Unknown Organism: primate 

<220> 

<221> CDS 

<222> (78).. (308) 



25 



<400> 20 

cgcgctgagg tggatttgta ccggagtccc atttgggagc aagagccatc tactcgtccg 60 



ttnccggcct tcccacc atg gat tgc caa gaa aat gag tac tgg gac caa 110 
30 Met Asp Cys Gin Glu Asn Glu Tyr Trp Asp Gin 

15 10 

tgg gga egg tgt gtc acc tgc cao egg tgt ggt cct gga cag gag eta 158 
Trp Gly Arg Cys Val Thr Cys Glr. Arg Cys Gly Pro Gly Gin Glu Leu 
35 15 20 25 

tec aag gat tgt ggt tat gga gag ggt gga gat gee tac tgc aca gee 206 
Ser Lys Asp Cys Gly Tyr Gly Glu Gly Gly Asp Ala Tyr Cys Thr Ala 
30 35 40 

40 

tgc cct cct cgc agg tac aaa age age tgg ggc cac cac aaa tgt cag 254 
Cys Pro Pro Arg Arg Tyr Lys Ser Ser Trp Gly His His Lys Cys Gin 
45 50 55 

45 agt tgc ate acc tgt get gtc ate aat cgt gtt cag aag gtc caa ctg 302 
Ser Cys lie Thr Cys Ala Val lie Asn Arg Val Gin Lys Val Gin Leu 
60 65 70 75 

cac age taacctctna tgctgtctgt ggggatgttt gncccaagtt ctnaccgaaa 358 
50 His Ser 

agacacgcca tgggaaggct ggenggacca ngaatggccn tcccgtggca gaaagecaga 418 

ccccccaacn nctgnaggtt ccaatgtggc ettnecattt ggaagcttan tgggaaggca 478 

55 

gatgncaacc caaagtggcc ccttcaggga ggecaaaatt tgttggcaat gggtgnagca 538 

gentgeca 546 
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10 



<210> 21 
<211> 77 
<212> PRT 
<213> Unknown 
<400> 21 



Met Asp Cys Gin Glu Asn Glu Tyr Trp Asp Gin Trp Gly Arg Cys Val 
IS 10 IS 



Thr Cys Gin Arg Cys Gly Pro Gly Gin Glu Leu Ser Lys Asp Cvs Gly 
15 20 25 30 

Tyr Gly Glu Gly Gly Asp Ala Tyr Cys Thr Ala Cys Pro Pro Arg Arg 
35 40 45 

20 Tyr Lys Ser Ser Trp Gly His His Lys Cys Gin Ser Cys lie Thr Cys 
50 55 60 



Ala Val lie Asn Arg Val Gin Lys Val Gin Leu His Ser 
70 75 



65 




<210> 


22 


<211> 


932 


<212> 


DNA 


<213> 


Unknown 


<220> 




<223> 


Description of 


<220> 




<221> 


CDS 


<222> 


(78) . . (770) 


<220> 




<221> 


misc_f eature 


<222> 


(782) 


<223> 


nucleotide may 


<400> 


22 



25 



30 



35 



40 

<222> (782) 

C, G, or T 

45 cgcgctgagg tggatttgta ccggagtccc atttgggagc aagagccatc tactcgtccg 60 

ttaccggcct tcccacc atg gat tgc caa gaa aat gag tac tgg gac caa 110 
Met Asp Cys Gin Glu Asn Glu Tyr Trp Asp Gin 
15 10 

50 

tgg gga egg tgt gtc acc tgc caa egg tgt ggt cct gga cag gag eta 158 
Trp Gly Arg Cys Val Thr Cys Gin Arg Cys Gly Pro Gly Gin Glu Leu 
15 20 25 

55 tec aag gat tgt ggt tat gga gag ggt gga gat gec tac tgc aca gee 206 
Ser Lys Asp Cys Gly Tyr Gly Glu Gly Gly Asp Ala Tyr Cys Thr Ala 
30 35 40 

tgc cct cct cgc agg tac aaa age age tgg ggc cac cac aaa tgt cag 254 
60 Cys Pro Pro Arg Arg Tyr Lys Ser Ser Trp Gly His His Lys Cys Gin 
45 50 55 
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agt tgc ate ace tgt get gtc ate aat cgt gtt cag aag gtc aac tgc 302 

5 Ser Cys lie Thr Cys Ala Vol He Asn Arg Val Gin Lys Val Asn Cys 
60 65 70 75 

aca get acc tct aat get gtc tgt ggg gac tgt ttg ccc agg ttc tac 350 
Thr Ala Thr Ser Asn Ala Val Cys Gly Asp Cys Leu Pro Arg Phe Tyr 
10 80 85 90 

cga aag aca cgc att gga ggc ctg cag gac caa gag tgc ate ccg tgc 398 

Arg Lys Thr Arg He Gly Gly Leu Gin Asp Gin Glu Cys He Pro Cys 
95 100 105 

15 

acg aag cag acc ccc acc tct gag gtt caa tgt gec ttc cag ttg age 446 

Thr Lys Gin Thr Pro Thr Ser Glu Val Gin Cys Ala Phe Gin Leu Ser 

110 115 120 

20 tta gtg gag gca gat gca ccc aca gtg ccc cct cag gag gec aca ctt 494 
Leu Val Glu Ala Asp Ala Pro Thr Val Pro Pro Gin Glu Ala Thr Leu 
125 130 135 

gtt gca ctg gtg age age ctg eta gtg gtg ttt acc ctg gec ttc ctg 542 
25 Val Ala Leu Vol Ser Ser Leu Leu Val Val Phe Thr Leu Ala Phe Leu 
140 145 150 155 

ggg etc ttc ttc etc tac tgc aag cag ttc ttc aac aga cat tgc cag 590 
Gly Leu Phe Phe Leu Tyr Cys Lys Gin Phe Phe Asn Arg His Cys Gin 
30 160 165 170 

cgt gga ggt ttg ctg cag ttt gag get gat aaa aca gca aag gag gaa 638 

Arg Gly Gly Leu Leu Gin Phe Glu Ala Asp Lys Thr Ala Lys Glu Glu 

175 180 185 

35 

tct etc ttc ccc gtg cca ccc age aag gag acc agt get gag tec caa 686 

Ser Leu Phe Pro Val Pro Pro Ser Lys Glu Thr Ser Ala Glu Ser Gin 
190 195 200 

40 gtc tct tgg gec cct ggc age ctt gec cag ttg ttc tct ctg gac tct 734 
Val Ser Trp Ala Pro Gly Ser Leu Ala Gin Leu Phe Ser Leu Asp Ser 
205 210 215 

gtt cct ata cca caa cag cag cag ggg cct gaa atg tgatgtccac 780 
45 Val Pro He Pro Gin Gin Gin Gin Gly Pro Glu Met 
220 225 230 

angagctaat accctacaga tggggcatat cctatcccat cccaccagag gattgattct 840 

50 ccatttcaca aggactgatc tggagcattt cttgcttccc tgttgtagtc tggggageca 900 

gattccacat tcatgggact accagacatg tt 932 

55 <210> 23 

<211> 231 

<212> PRT 

<213> Unknown 

60 <400> 23 

Met Asp Cys Gin Glu Asn Glu Tyr Trp Asp Gin Trp Gly Arg Cys Val 
15 10 15 
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5 Thr Cys Gin Arg Cys Gly Pro Gly Gin Glu Leu Ser Lys Asp Cys Gly 
20 25 30 

Tyr Gly Glu Gly Gly Asp Ala Tyr Cys Thr Ala Cys Pro Pro Arg Arg 
35 40 45 

10 

Tyr Lys Ser Ser Trp Gly His His Lys Cys Gin Ser Cy s He Thr Cys 
50 55 60 

Ala Val lie Asn Arg Val Gin Lys Val Asn Cys Thr Ala Thr Ser Asn 
15 65 70 75 80 

Ala Val Cys Gly Asp Cys Leu Pre Arg Phe Tyr Arg Lys Thr Arg lie 
85 90 95 

20 Gly Gly Leu Gin Asp Gin Glu Cys lie Pro Cys Thr Lys Gin Thr Pro 
100 105 110 

Thr Ser Glu Val Glr. Cys Ala Phe Gin Leu Ser Leu Val Glu Ala Asp 
115 120 125 

25 

Ala Pro Thr Val Pre Pro Gin Glu Ala Thr Leu Val Ala Leu Val Ser 
130 135 140 

Ser Leu Leu Val Val Phe Thr Leu Ala Phe Leu Gly Leu Phe Phe Leu 
30 145 150 155 160 

Tyr Cys Lys Gin Phe Phe Asn Arg His Cys Gin Arg Gly Gly Leu Leu 
165 170 175 

35 Gin Phe Glu Ala Asp Lys Thr Ala Lys Glu Glu Ser Leu Phe Pro Val 
180 185 190 

Pro Pro Ser Lys Glu Thr Ser Ala Glu Ser Gin Val Ser Trp Ala Pro 
195 200 205 

40 

Gly Ser Leu Ala Gin Leu Phe Ser Leu Asp Ser Val Pro He Pro Gin 
210 215 220 

Gin Gin Gin Gly Pro Glu Met 
45 225 230 



<210> 24 

<211> 232 

50 <212> DNA 

<213> Unknown 

<220> 

<223> Description o£ Unknown Organism: primate 

55 

<400> 24 

ttggccttag ggaccaagct tttatcatcg tcagtgggoc ttaocctgtc ttaaaagtgc 60 
tgcttctcct acactcgctc aagatcccga gtcagctgta ttatggcatc ctattagtca 120 

60 

ggcagcctgt gcttcaagcc cgtagttgta ttcatcccct aaaggggeca ttccgtttgt 180 
atcatcacat gtcotcagtg ggtccatgtg tatatcaagg acatgatgea ga 232 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/0I8I7 



PCT/US99/12366 



<210> 25 36 

<211> 77 

<212> PRT 

< 2 1 3 > Unknown 

<220> 



10 <223> Description of Unknown Organism: primate 
<400> 25 

Leu Ala Leu Gly Thr T.ys Leu Leu Ser Ser Ser Val Gly Leu Asn Leu 
15 10 15 

Ser Xaa Lys Cys Cys Phe Ser Tyr Thr Arg Ser Arg Ser Arg Val Ser 
20 25 30 



15 



Cys lie Met Ala Ser Tyr Xaa Ser Gly Ser Leu Cys Phe Lys Pro Val 
20 35 40 45 

Val Val Phe lie Pro Xaa Arg Gly His Ser Val Cys lie lie Thr Cys 
50 55 60 

25 Pro Gin Trp Val His Val Tyr He Lys Asp Met Met Gin 
65 70 75 



<210> 26 

3 0 <211> 72 

<212> PRT 

<213> Unknown 

<220> 

35 <223> Description of Unknown Organismrpriiaate 
<400> 26 

Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro Ser Glu Cys 
15 10 15 



40 



Cys Phe Thr Tyr Thr Thr Tyr Lys He Pro Arg Gin Arg He Met Asp 
20 25 30 



Tyr Tyr Glu Thr Asn Ser Gin Cys Ser Lys Pro Gly lie Val Phe He 

45 35 40 45 

Thr Xaa Arg Gly His Ser Val Cys Thr Asn Pro Ser Asp Lys Trp Val 

50 55 60 

50 Gin Asp Tyr He Lys Asp Met Lys 

65 70 



<210> 27 
55 <211> 143 
<212> PRT 
<213> Unknown 



<220> 

60 <223> Description of Unknown Organism: primate 
<400> 27 

Met Lys lie Ser Val Ala Ala He Pro Phe Phe Leu Leu He Thr He 
15 10 15 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/12366 



37 

Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro 
5 20 25 30 

Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr Lys He Pro Arg Gin Arg 
35 40 45 

10 He Met Asp Tyr Tyr Glu Thr Asn Ser Gin Cys Ser Lys Pro Gly He 
50 E5 60 

Val Phe He Thr Lys Arg Gly His Ser Val Cys Thr Asn Pro Ser Asp 
65 70 75 80 

15 

Lys Trp Val Gin Asp Tyr He Lys Asp Met Lys Glu Asn Thr Lys Thr 
85 90 95 

Glu Ser Ser Ser Arg Gly Pro Tyr His Pro Ser Glu Cys Cys Phe Thr 
20 100 105 110 

Tyr Thr Thr Tyr Lys He Pro Arg Gin Arg He Met Asp Tyr Tyr Glu 
H5 120 125 

25 Thr Asn Ser Gin Cys Ser Lys Pro Gly He Val Phe He Thr Xaa 
130 135 140 



<210> 28 
30 <211> 93 
<212> PRT 
<213> Unknown 

<220> 

35 <223> Description of Unknown Organism: primate 
<400> 28 

Met Lys He Ser Val Ala Ala He Pro Phe Phe Leu Leu He Thr He 
15 10 15 

40 

Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro 
20 25 30 

Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr Lys He Pro Arg Gin Arg 
45 35 40 45 

He Met Asp Tyr Tyr Glu Thr Asn Ser Gin Cys Ser Lys Pro Gly He 
50 55 60 

50 Val Phe He Thr Lys Arg Gly His Ser Val Cys Thr Asn Pro Ser Asp 
65 70 75 80 

Lys Trp Val Gin Asp Tyr He Lys Asp Met Lys Glu Asn 
85 90 

55 

<210> 29 
<211> 93 
<212> PRT 
60 <213> Unknown 

<220> 

<223> Description of Unknown Organism: primate 
<400> 29 
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Met Lys lie Ser Val Ala Ala lie Pro Phe Phe Leu Leu lie Thr lie 
5 1 5 10 15 

Ala lieu Gly Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro 
20 25 30 

10 Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr Lys He Pro Arg Gin Arg 
35 40 45 



15 



He Met Asp Tyr Tyr Glu Thr Asn Ser Gin Cys Ser Lys Pro Gly He 
50 55 60 

Val Phe He Thr Lys Arg Gly His Ser Val Cys Thr Asn Pro Ser Asp 

65 70 75 80 



Lys Trp Val Gin Asp Tyr He Lys Asp Met Lys Glu Asn 
20 85 90 



<210> 30 
<211> 93 
25 <212> PRT 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: primate 

30 

<400> 30 

Met Lys He Ser Val Ala Ala He Pro Phe Phe Leu Leu He Thr He 
15 10 15 

35 Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro 
20 25 30 

Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr Lys He Pro Arg Gin Arg 
35 40 45 

40 

He Met Asp Tyr Tyr Glu Thr Asn Ser Gin Cys Ser Lys Pro Gly He 
50 55 60 

Val Phe He Thr Lys Arg Gly His Ser Val Cys Thr Asn Pro Ser Asp 
45 65 70 75 80 

Lys Trp Val Gin Asp Tyr He Lys Asp Met Lys Glu Asn 
85 90 

50 

<210> 31 
<211> 1082 
<212> DNA 
<213> Unknown 

55 

<220> 

<223> Description of Unknown Organism: primate 

<220> 
60 <221> CDS 

<222> (1) . . (1080) 

<220> 

<221> raisc_feature 
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<222> (20) 
5 <223> nucleoside may be G 

<220> 

<221> misc_feature 
<222> (56) 

10 <223> nucleotide may be A, C, G or T 
<220> 

<221> inisc_f eature 
<222> (103) 
15 <223> nucleotide may be A, C, G, or T 

<220> 

<221> misc_f eature 
<222> (130) 
20 <223> nucleotide may be C or T 

<220> 

<221> misc_feature 
<222> (190) 
25 <223> nucleotide may be A or C 

<220> 

<221> misc_feature 
<222> (256) 
30 <223> nucleotide may be C or G 

<400> 31 

atg cct ttc ccr. ggc cca cac gca ggt aga tct tec act eta aag gac 43 
Met Pro Phe Pro Gly Pro His Ala Gly Arg Ser Ser Thr Leu Lys Asp 
35 1 5 10 15 

acc acc cct cca tec cac caa ata ttt gga agg etc ctg gaa gat etc 96 

Thr Thr Pro Pro Ser His Gin lie Phe Gly Arg Leu Leu Glu Asp Leu 

20 25 30 

40 

caa ate caa gtg tct ccc act gee cac ggc att cca gac act ttt gac 144 

Gin He Gin Val Ser Pro Thr Ala His Gly He Pro Asp Thr Phe Asp 
35 40 45 

45 cct tac ctg gac ate gec ctg gat ate cag gca get cag agt gtc cag 192 
Pro Tyr Leu Asp He Ala Leu Asp He Gin Ala Ala Gin Ser Val Gin 
50 55 60 

caa get ttg gaa cag ttg gtg aag ccc gaa gaa etc aat gga gag aat 240 
50 Gin Ala Leu Glu Gin Leu Val Lys Pro Glu Glu Leu Asn Gly Glu Asn 
65 70 75 80 

gec tat cat tgt ggt ctt tgt etc cag agg gcg ccg gec tec aag acg 288 
Ala Tyr His Cys Gly Leu Cys Leu Gin Arg Ala Pro Ala Ser Lys Thr 
55 85 90 95 

tta act tta cac acc tct gec aag gtc etc ate ctt gtc ttg aag aga 336 
Leu Thr Leu His Thr Ser Ala Lys Val Leu He Leu Val Leu Lys Arg 
100 105 110 



60 



ttc tec gat gtc aca ggc aac aag att gec aag aat gtg caa tat cct 384 
Phe Ser Asp Val Thx Gly Asn Lys He Ala Lys Asn Val Gin Tyr Pro 
115 120 125 
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gag tgc ctt gac atg cag cca tac atg tct cag cag aac aca gga cct 432 

5 Glu Cys Leu Asp Met Gin Pro Tyr Met Ser Gin Gin Asn Thr Giy Pro 
130 135 140 

ctt gcc tat gtc etc tat get gtg ctg gtc cac get ggg tgg agt tgt 480 

Leu Val Tyr Val Leu Tyr Ala Val Leu Val His Ala Gly Trp Ser Cys 
10 145 150 155 160 



15 



20 



40 



cac aac gga cat tac ttc tct tat gtc aaa get caa gaa ggc cag tgg 528 

His Asn Gly His Tyr Phe Ser Tyr Val Lys Ala Gin Glu Gly Gin Trp 

165 170 175 

tat aaa ate gat gat gcc gag gtc acc gcc tct age ate act tct gtc 575 

Tyr Lys Met Asp Asp Ala Glu Val Thr Ala Ser Ser lie Thr Ser Val 
180 185 190 

ctg agt caa cag gcc tac gtc etc ttt tac a-c cag aag agt gaa tgg 624 

Leu ser Gin Gin Ala Tyr Val Leu Phe Tyr He Gin Lys Ser Glu Trp 
195 200 205 



gaa aga cac agt gag agt gtg tea aga ggc agg gaa cca aga gcc ctt 672 
25 Glu Arg His Ser Glu Ser Val Ser Arg Gly Arg Glu Pro Arg Ala Leu 
210 215 220 

ggc gca gaa gac aca gac agg cga gca acg caa gga gag etc aag aga 720 
Gly Ala Glu Asp Thr Asp Arg Arg Ala Thr Gin Gly Glu Leu Lys Arg 
30 225 230 235 240 

gac cac ccc tgc etc cag gcc ccc gag ttg gac gag cac ttg gtg gaa 768 

Asp His Pro Cys Leu Gin Ala Pro Glu Leu Asp Glu His Leu Val Glu 
245 250 255 

35 

aga gcc act cag gaa age acc tta gac cac tgg aaa ttc ctt caa gag 816 

Arg Ala Thr Gin Glu Ser Thr Leu Asp His Trp Lys Phe Leu Gin Glu 
260 265 270 

caa aac aaa acg aag cct gag ttc aac gtc aga aaa gtc gaa ggt acc 864 
Gin Asn Lys Thr Lys Pro Glu Phe Asn Val Arg Lys Val Glu Gly Thr 
275 280 285 

ctg cct ccc gac gta ctt gtg att cat caa tea aaa tac aag tgt ggg 912 
45 Leu Pro Pro Asp Val Leu Val He His Gin Ser Lys Tyr Lys Cys Gly 
290 295 300 

atg aag aac cat cat cct gaa cag caa age tec ctg eta aac etc tct 960 
Met Lys Asn Kis His Pro Glu Gin Gin Ser Ser Leu Leu Asn Leu Ser 
50 305 310 315 32C 

teg acg acc ccg aca cat cag gag tec atg aac act ggc aca etc get 1008 
Ser Thr Thr Pro Thr His Gin Glu Ser Met Asn Thr Gly Thr Leu Ala 
325 330 335 

55 

tec ctg cga ggg agg gcc agg aga tec aaa ggg aag aac aaa cac age 1056 
Ser Leu Arg Gly Arg Ala Arg Arg Ser Lys Gly Lys Asn Lys His Ser 
340 345 350 

60 aag agg get ctg ctt gtg tgc cag tg 1082 
Lys Arg Ala Leu Leu Val Cys Gin 
355 360 
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<210> 32 
5 <211> 360 
<212> PRT 
<213> Unknown 

<400> 32 

10 Met Pro Phe Pro Gly Pro His Ala Gly Arg Ser Ser Thr Leu Lys Asp 
1 5 10 15 

Thr Thr Pro Pro Ser His Gin lie Phe Gly Arg Leu Leu Glu Asp Leu 
20 25 30 

15 

Gin lie Gin Val Ser Pro Thr Ala His Gly lie Pro Asp Thr Phe Asp 
35 40 45 

Pro Tyr Leu Asp lie Ala Leu Asp lie Gin Ala Ala Gin Ser Val Gin 
20 50 55 60 

Gin Ala Leu Glu Gin Leu Val Lys Pro Glu Glu Leu Asn Gly Glu Asn 
65 70 75 80 

25 Ala Tyr His Cys Gly Leu Cys Leu Gin Arg Ala Pro Ala Ser Lys Thr 
85 90 95 

Leu Thr Leu His Thr Ser Ala Lys Val Leu lie Leu Val Leu Lys Arg 
100 105 110 

30 

Phe Ser Asp Val Thr Gly Asn Lys lie Ala Lys Asn Val Gin Tyr Pro 
115 120 125 

Glu Cys Leu Asp Met Gin Pro Tyr Met Ser Gin Gin Asn Thr Gly Pro 
35 130 135 140 

Leu Val Tyr Val Leu Tyr Ala Val Leu Val His Ala Gly Trp Ser Cys 
145 150 155 160 

40 His Asn Gly His Tyr Phe Ser Tyr Val Lys Ala Gin Glu Gly Gin Trp 
165 170 175 

Tyr Lys Met Asp Asp Ala Glu Val Thr Ala Ser Ser He Thr £er Val 
180 185 190 

45 

Leu Ser Gin Gin Ala Tyr Val Leu Phe Tyr lie Gin Lys Ser Glu Trp 
195 200 205 

Glu Arg His Ser Glu Ser Val Ser Arg Gly Arg Glu Pro Arg Ala Leu 
50 210 215 220 

Gly Ala Glu Asp Thr Asp Arg Arg Ala Thr Gin Gly Glu Leu Lys Arg 
225 230 235 240 

55 Asp His Pro Cys Leu Gin Ala Pro Glu Leu Asp Glu His Leu Val Glu 
245 250 255 

Arg Ala Thr Gin Glu Ser Thr Leu Asp His Trp Lys Phe Leu Gin Glu 
260 265 270 

60 

Gin Asn Lys Thr Lys Pro Glu Phe Asn Val Arg Lys Val Glu Gly Thr 
275 280 285 
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Leu Pro Pro Asp Val Leu Val lie His Gin Ser Lys Tyr Lys Cys Gly 
5 290 295 300 

Met Lys Asn His His Pro Glu Gin Gin Ser Ser Leu Leu Asn Leu Ser 

305 310 315 320 

10 Ser Thr Thr Pro Thr His Gin. Glu Ser Met Asn Thr Gly Thr Leu Ala 
325 330 335 



15 



Ser Leu Arg Gly Arg Ala Arg Arg Ser Lys Gly Lys Asn Lys His Ser 
340 345 350 

Lys Arg Ala Leu Leu Val Cys Gin 
355 360 



20 <210> 33 

<211> 1683 
<212> DNA 
<213> Unknown 



25 <220> 

<223> Description of Unknown Organism: primate 

<220> 
<221> CDS 
30 <222> (1) . . (15901 



<400> 33 

atg gag gac gac tea etc tac ttg gga ggt gag tgg cag ttc aac cac 48 

Met Glu Asp Asp Ser Leu Tyr Leu Gly Gly Glu Trp Gin Phe Asn His 
35 1 5 10 15 



40 



ttt tea aaa c^c aca tct tct egg cca gat gca get ttt get gaa ate 96 
Phe Ser Lys Leu Thr Ser Ser Arg Pro Asp Ala Ala Phe Ala Glu He 
20 25 30 

cag egg ac- tct etc cct gag aag tea cca etc tea tct gag gee cgt 144 
Gin Arg Thr Ser Leu Pro Glu Lys Ser Pro Leu Ser Ser Glu Ala Arg 
35 40 45 



45 gtc gac etc tgt gat gat ttg get cct gtg gca aga cag ctt get ccc 192 
Val Asp Leu Cys Asp Asp Leu Ala Pro Val Ala Arg Gin Leu Ala Pro 
50 55 60 



agg gag aag ctt cct ctg agt age agg aga cct get gcg gtg ggg get 240 
50 Arg Glu Lys Leu Pro Leu Ser Ser Arg Arg Pro Ala Ala Val Gly Ala 
65 70 75 SO 



ggg etc cag aat atg gga aat ace tgc tac gag aac get tec ctg cag 288 
Gly Leu Gin Asn Met Gly Asn Thr Cys Tyr Glu Asn Ala Ser Leu Gin 
55 85 90 95 



60 



tgc ctg aca tac aca ccg ccc ctt gee aac tac atg ctg tec egg gag 336 
Cys Leu Thr Tyr Thr Pro Pro Leu Ala Asn Tyr Met Leu Ser Arg Glu 
100 105 110 

cac tct caa aca tgt cag cgt ccc aag tgc tgc atg etc tgt act atg 384 
His Ser Gin Thr Cys Gin Arg Pro Lys Cys Cys Met Leu Cys Thr Met 
115 120 125 
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caa get cac ate aca tgg gec etc cac agt cct ggt cat gtc ate cag 432 

5 Gin Ala His Zip. Thr Trp Ala Leu His Ser Pro Gly His Val lie Gin 
130 135 140 

ccc tea cag gca ttg get act ggc ttc cat aga ggc aag cag gaa gat 480 
Pro Ser Gin Ala Leu Ala Ala Gly Phe His Arg Gly Lys Gin Glu Asp 
10 145 ISO 155 160 

gec cat gaa ttt etc atg ttc act gtg gat gec atg aaa aag gca tgc 528 
Ala His Glu Phe Leu Met Phe Thr Val Asp Ala Met Lys Lys Ala Cys 
165 170 175 

15 

ctt ccc ggc cac aag cag gta gat cat cac Let aag gac acc acc etc 576 
Leu Pro Gly His Lys Gin Val Asp His His Ser Lys Asp Thr Thr Leu 
180 185 190 

20 ate cac caa ata ttt gga ggc tgc tgg aga tct caa ate aag tgt etc 624 
He His Gin He Phe Gly Gly Cys Trp Arg Ser Gin He Lys Cys Leu 
195 200 205 

cac tgc cac ggg at-.t cca gac act ttt gac cct tac ctg gac ate gec 672 
25 His Cys His Gly He Pro Asp Thr Phe Asp Pro Tyr Leu Asp He Ale 
210 215 223 

ctg gat ate cag gca get cag agt gtc aag caa get ttg gaa cag ttg 720 
Leu Asp He Gin Ala Ala Gin Ser Val Lys Gin Ala Leu Glu Gin Leu 
30 225 230 235 240 

gtg aag ccc gaa gaa etc aat gga gag aat gec taz cat tgt ggt ctt 768 
Val Lys Pro Glu Glu Leu Asn Gly Glu Asn Ala Tyr His Cys Gly Leu 
245 250 255 

35 

tgt etc cag agg gcg ccg gec tec aag acg tta act tta cac act tct 816 
Cys Leu Gin Arg Ala Pro Ala Ser Lys Thr Leu Thr Leu His Thr Ser 
260 265 270 

40 gec aag gtc etc ate ctt gtm ttg aag aga ttc tec gat gtc aca ggc 864 
Ala Lys Val Leu He Leu Xaa Leu Lys Arg Phe Ser Asp Val Thr Gly 
275 260 285 

aac aaa ctt gec aag aat gtg caa tat cct gag tgc ctt gac atg cag 912 
45 Asn Lys Leu Ala Lys Asn Val Gin Tyr Pro Glu Cys Leu Asp Met Gin 
290 295 300 

cca tac atg tct cag cag aac aca gga cct ctt gtc tat gtc etc tat 960 
Pro Tyr Met Ser Gin Gin Asn Thr Gly Pro Leu Val Tyr Val Leu Tyr 
50 305 310 315 320 

get gtg ctg gtc cac get ggg tgg agt tgt cac aac gga cat tac ttc 1008 

Ala Val Leu Val His Ala Gly Trp Ser Cys His Asn Gly His Tyr Phe 
325 330 335 

55 

tct tat gtc aaa get caa gaa ggc cag tgg tat aaa atg gat gat gec 1056 

Ser Tyr Val Lys Ala Gin Glu Gly Gin Trp Tyr Lys Met Asp Asp Ala 
340 345 350 

60 gag gtc acc gec tct age ate act tct gtc ctg agt caa cag gec tac 1104 
Glu Val Thr Ala Ser Ser He Thr Ser Val Leu Ser Gin Gin Ala Tyr 
355 360 365 
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gtc etc ttt tac ate cag aag agt gaa tgg gaa aga cac agt gag agt 
Val r,eu Phe Tyr lie Gin Lys Ser Glu Trp Glu Arg His Ser Glu Ser 
370 375 380 



10 



gtg tea aga ggc agg gaa cca aga gec ctt ggc gca gaa gac aca gac 
Val Ser Arg Gly Arg Glu Pro Arg Ala Leu Gly Ala Glu Asp Thr Asp 
385 390 395 400 



1200 



15 



agg cga gca acg caa gga gag etc aag aga gac cac ccc tgc etc cag 1248 
Arg Arg Ala Thr Gin Gly Glu Leu Lys Arg Asp His Pro Cys Leu Gin 
405 410 415 

gec ccc gag ttg gac gag cac ttg gtg gaa aga gec act cag gaa age 1296 
Ala Pro Glu Leu Asp Glu His Leu Val Glu Arg Ala Thr Gin Glu Ser 
420 425 430 



20 acc tta gac cac tgg aaa ttc ctt caa gag caa aac aaa acg aag cct 
Thr Leu Asp His Trp Lys Phe Leu Gin Glu Gin Asn Lys Thr Lys Pro 
435 440 445 

gag ttc aac gtc aga aaa gec gaa ggt acc ctg cct ccc gac gta ctt 
25 Glu Phe Asn Val Arg Lys Val Glu Gly Thr Leu Pro Pro Asp Val Leu 
450 455 460 



30 



gtg att cat caa tea aaa tac aag tgt ggg atg aag aac cat cat cct 
Val lie His Gin Ser Lys Tyr Lys Cys Gly Met Lys Asn His His Pro 
465 470 475 480 



35 



gaa cag caa age tec ctg eta aac etc tct teg acg acc ccg aca cat 14 88 
Glu Gin Gin Ser Ser Leu Leu Asn Leu Ser Ser Thr Thr Pro Thr His 
485 490 495 

cag gag tec atg aac act ggc aca etc get tec ctg cga ggg agg gec 1536 
Gin Glu Ser Met Asn Thr Gly Thr Leu Ala Ser Leu Arg Gly Arg Ala 
500 505 510 



40 agg aga tec aaa ggg aag aac aaa cac age aag agg get ctg ctt gtg 
Arg Arg Ser Lys Gly Lys Asn Lys His Ser Lys Arg Ala Leu Leu Val 
515 520 525 



1584 



tgc cag tgatctcagt ggangtaccg acccacacgt aggggtgeae acacacacgc 
45 Cys Gin 
530 



1640 



50 



55 



acacacacag acacacacat aactacaccc agaagegege tga 



<210> 34 
<211> 530 
<212> PRT 
<213> Unknown 

<400> 34 

Met Glu Asp Asp Ser Leu Tyr Leu Gly Gly Glu Trp Gin Phe Asn His 
15 10 15 



1683 



60 



Plie Ser Lys Leu Thr Ser Ser Arg Pro Asp Ala Ala Phe Ala Glu He 
20 25 30 



Gin Arg Thr Ser Leu Pro Glu Lys Ser Pro Leu Ser Ser Glu Ala Arg 
35 40 45 
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Val Asp Leu Cys Asp Asp Leu Ala Pro Val Ala Arg Gin Leu Ala Pro 
5 50 55 60 

Arg Glu Lys Leu Pro Leu Ser Ser Arg Arg Pro Ala Ala Val Gly Ala 
65 70 75 80 

10 Gly Leu Gin Asn Me: Gly Asn Thr Cys Tyr Glu Asn Ala Ser Leu Gin 
85 90 95 



15 



30 



45 



60 



Cys Leu Thr Tyr Thr Pro Pro Leu Ala Asn Tyr Met Leu Ser Arg Glu 

100 105 110 

His Ser Glr. Thr Cys Gin Arg Pro Lys Cys Cys Met Leu Cys Thr Met 
115 120 125 



Gin Ala His He Thr Trp Ala Leu His Ser Pro Gly His Val He Gin 
20 130 135 140 

Pro Ser Gin Ala Leu Ala Ala Gly Phe His Arg Gly Lys Gin Glu Asp 
145 150 155 160 

25 Ala His Glu Phe Leu Met Phe Thr Vol Asp Ala Met Lys Lys Ala Cys 
165 170 175 



Leu Pro Gly His Lys Gin Val Asp His Hi s Ser Lys Asp Thr Thr Leu 
180 185 190 

He His Gin He Phe Gly Gly Cys Trp Arg Ser Glii He Lys Cys Leu 
195 200 205 



His Cys His Gly He Pro Asp Thr Phe Asp Pro Tyr Leu Asp He Ala 
35 210 215 220 

Leu Asp He Gin Ala Ala Gin Ser Val Lys Gin Ala Leu Glu Gin Leu 
225 230 235 240 

40 Val Lys Pro Glu Glu Leu Asn Gly Glu Asn Ala Tyr His Cys Gly Leu 
245 250 255 



Cys Leu Gin Arg Ala Pro Ala Ser Lys Thr Leu Thr Leu His Thr Ser 
260 265 270 

Ala Lys Val Leu He Leu Xaa Leu Lys Arg Phe Ser Asp Val Thr Gly 
275 280 285 



Asn Lys Leu Ala Lys Asn Val Gin Tyr Pro Glu Cys Leu Asp Met Gin 
50 290 295 300 

Pro Tyr Met Ser Gin Gin Asn Thr Gly Pro Leu Val Tyr Val Leu Tyr 
305 310 315 320 

55 Ala Val Leu Val His Ala Gly Trp Ser Cys His Asn Gly His Tyr Phe 
325 330 335 



Ser Tyr Val Lys Ala Gin Glu Gly Gin Trp Tyr Lys Met Asp Asp Ala 
340 345 350 

Glu Val Thr Ala Ser Ser He Thr Ser Val Leu Ser Gin Gin Ala Tyr 
355 360 365 
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Val Leu Phe Tyr He Gin Lys Ser Glu Trp Glu Arg His Ser Glu Ser 
5 370 375 360 

Val Ser Arg Gly Arg Glu Pro Arg Ala Leu Gly Ala Glu Asp Thr Asp 
385 390 395 400 

10 Arg Arg Ala Thr Gin Gly Glu Leu Lys Arg Asp His Pro Cys Leu Gin 
405 410 415 



15 



30 



Ala Pro Glu Leu Asp Glu His Leu Val Glu Arg Ala Thr Gin Glu Ser 
420 425 430 

Thr Leu Asp His Trp Lys Phe Leu Gin Glu Gin Asn Lys Thr Lys Pro 
435 440 445 



Glu Phe Asn Val Arg Lys Val Glu Gly Thr Leu Pro Pro Asp Val Leu 
20 450 455 460 

Val He His Gin Ser Lys Tyr Lys Cys Gly Met Lys Asn His His Pro 

465 470 475 480 

25 Glu Gin Gin Ser Ser Leu Leu Asn Leu Ser Ser Thr Thr Pro Thr His 

485 490 495 



Gin Glu Ser Met Asn Thr Gly Thr Leu Ala Ser Leu Arg Gly Arg Ala 
500 505 510 

Arg Arg Ser Lys Gly Lys Asn Lys His Ser Lys Arg Ala Leu Leu Val 
515 520 525 



Cys Gin 
35 530 



<210> 35 
<211> 735 
40 <212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: primate 

45 

<220> 

<221> CDS 

<222> (1)..(735) 

50 <220> 

<221> misc_feature 
<222> (197) 

<223> nucleotide may be A or C 

55 <220> 

<221> misc_feature 

<222> (590) 

<223> nucleotide may be A, C, G, or T 

60 <220> 

<221> misc_£eature 
<222> (664) 

<223> nucleotide may be A, C, G, or T 
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<220> 

5 <221> misc_f eature 
<222> (6S5) 

<223> nucleoside may be A, C, G, or T 
<220> 

10 <221> misc_f eature 
<222> (666) 

<223> nucleotide may be A, C, G, or T 
<220> 

15 <221> miec_f eature 
<222> (708) 

<223> nucleotide may be A, C. G, or T 
<220> 

20 <221> misc_f eature 
<222> (715) 

<223> nucleotide may be A, C, G, or T 
<400> 35 

25 atg get gtg cca agt tgg ate gtc aaa cgc agg eta eta cct tgg tec 48 
Met Ala val Pro Ser Trp lie Val Lys Arg Arg Leu Leu Pro Trp Ser 
15 10 15 

ate aaa ttt ttg gag ggt ate tea gat cac ggc gtg aag tgc tec gtg 96 
30 lie Lys Phe Leu Glu Gly lie Ser Asp His Gly Val Lys Cys Ser Val 
20 25 30 

tgc aag age etc teg gac ace tac gac ccc tac ttg gac gtc gcg ctg 144 
Cys Lys Ser Val Ser Asp Thr Tyr Asp Pro Tyr Leu Asp Val Ala Leu 
35 35 40 45 

gag ate ccg caa get gcg aat att gtg cgt get ctg gaa ctt ttc gtg 192 

Glu lie Arg Gin Ala Ala Asn lis Val Arg Ala Leu Glu Leu Phe Val 
50 55 60 

40 

aaa gca gat gtc ctg agt gga gag aat ccc tac atg tgt get aaa tgc 240 

Lys Ala Asp Val Leu Ser Gly Glu Asn Ala Tyr Me~ Cys Ala Lyc Cys 
65 70 75 80 

45 aag aag aag gtt cca gec age aag cgc ttc ace ate cac aga aca tec 288 
Lys Lys Lys Val Pro Ala Ser Lys Arg Phe Thr lie His Arg Thr Ser 
85 90 95 

aac gtc tta ace ctt tec etc aag cgc ttt gee aac ttc age ggg ggg 336 
50 Asn Val Leu Thr Leu Ser Leu Lys Arg Phe Ala Asn Phe Ser Glv Gly 
100 105 110 

aag ate ace aag gat gta ggc tat ccg gaa ttc etc aac ata cgt ccg 3 84 
Lys He Thr Lys Asp Val Gly Tyr Pro Glu Phe Leu Asn He Arg Pro 
55 115 120 125 

tat atg tec cag aat aat ggt gat cct gtc atg tat gga etc tat get 432 
Tyr Met Ser Gin Asn Asn Gly Asp Pro Val Met Tyr Gly Leu Tyr Ala 
130 135 140 



60 



gtc ctg gtg cac teg ggc tac age tgc cat gec ggg cac tat tac tgc 480 
Val Leu Val His Ser Gly Tyr Ser Cys His Ala Gly His Tyr Tyr Cys 
145 150 155 160 
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tac gtg aag gca age aat gga cag tgg tac cag atg aat gat tec ttg 528 
5 Tyr Val Lys Ala Ser Asn Gly Gin Trp Tyr Gin Met Asn Asp Ser Leu 
16= 170 175 

gtc cca ttc cag caa cgt cca agt tgg ttt ctg aaa cca gca ggc eta 576 
Val Pro Phe Gin Gin Arg Pro Ser Trp Phe Leu Lys Pro Ala Gly Leu 
10 180 185 190 

agt ggc ttg ttc tea teg gcg aat ttc cag get etc aag aaa aat tec 624 

Ser Gly Leu Phe Ser Ser Ala Asn Phe Gin Ala Leu Lys Lys Asn Ser 
1S5 200 205 

15 

cga agg gec tec att ttc cag gaa cag gtt cct tec tec cct tec egg 672 

Arg Arg Ala Ser He Phe Gin Glu Gin Val Pro Ser Ser Pro Ser Arg 
210 215 220 

2 0 gcg gec cga att gtg aat tec aga ttc att ccc age agg aac etc ggc 720 
Ala Ala Arg He Val Asn Ser Arg Phe He Pro Ser Arg Asn Leu Gly 
225 230 235 240 

aat ggg gat tat ttt 73 5 

25 Asn Gly Asp Tyr Phe 
245 



<210> 36 
30 <211> 245 
<212> PRT 
<213> Unknown 

<400> 36 

35 Met Ala Val Pro Ser Trp He Val Lys Arg Arg Leu Leu Pro Trp Ser 
15 10 15 

He Lys Phe Leu Glu Gly lie Ser Asp His Gly Val Lys Cys Ser Val 
20 25 30 

40 

Cys Lys Ser Val Ser Asp Thr Tyr Asp Pro Tyr Leu Asp Val Ala Leu 
35 40 45 

Glu He Arg Gin Ala Ala Asn He Val Arg Ala Leu Glu Leu Phe Val 
45 50 55 60 

Lys Ala Asp Val Leu Ser Gly Glu Asn Ala Tyr Met Cys Ala Lys Cys 
65 70 75 80 

50 Lys Lys Lys Val Pro Ala Ser Lys Arg Phe Thr He His Arg Thr Ser 
85 90 95 

Asn Val Leu Thr Leu Ser Leu Lys Arg Phe Ala Asn Phe Ser Gly Gly 
100 105 110 

55 

Lys He Thr Lys Asp Val Gly Tyr Pro Glu Phe Leu Asn He Arg Pro 
115 120 125 

Tyr Met Ser Gin Asn Asn Gly Asp Pro Val Met Tyr Gly Leu Tyr Ala 
60 130 135 140 

Val Leu Val His Ser Gly Tyr Ser Cys His Ala Gly His Tyr Tyr Cys 
145 150 155 160 
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Tyr Val Lys Ala Ser Asn Gly Gin Trp Tyr Gin Met Asn Asp Ser Leu 
5 165 170 175 

Val Pro Phe Gin Gin Arg Pro Ser Trp Phe Leu Lys Pro Ala Gly Leu 
180 185 190 

10 Ser Gly Leu Phe Ser Ser Ala Asn Phe Gin Ala Leu Lys Lys Asn Ser 
19S 200 205 



15 



Arg Arg Ala Ser lie Phe Gin Glu Gin Val Pro Ser Ser Pro Ser Arg 

210 215 220 

Ala Ala Arg lie Val Asn Ser Arg Phe lie Pro Ser Arg Asn Leu Gly 

225 230 235 240 



Asn Gly Asp Tyr Phe 
20 245 



<210> 37 
<211> 2244 
2 5 <212> DNA 

<213> Unknown 



30 



<220> 

<223> Description of Unknown Organism: primate 

<220> 

<221> CDS 

<222> (1) . . (2244) 

35 <40C> 37 

atg cag aaa gcc tgc ctg aat ggc tgt gcc aag ttg gat cgt caa acg 48 

Met Gin Lys Ala Cys Leu Asn Gly Cys Ala Lys Leu Asp Arg Gin Thr 
15 10 15 

40 cag get act acc ttg gtc cat caa att ttt gga ggg tat etc aga tea 96 
Gin Ala Thr Thr Leu Val His Gin lie Phe Gly Gly Tyr Leu Arg Ser 
20 25 30 

cgc gtg aag tgc tec gtg tgc aag age gtc teg gac acc tac gac ccc 144 
45 Arg Val Lys Cys Ser Val Cys Lys Ser Val Ser Asp Thr Tyr Asp Pro 
35 40 45 

tac ttg gac gtc gcg ctg gag ate egg caa get gcg aat att gtg cgt 192 
Tyr Leu Asp Vol Ala Leu Glu lie Arg Gin Ala Ala Asn lie Val Arg 
50 50 55 60 

get ctg gaa ctt ttt gtg aaa gca gat gtc ctg agt gga gag aat gcc 240 

Ala Leu Glu Leu Phe Val Lys Ala Asp Val Leu Ser Gly Glu Asn Ala 

65 70 75 80 

55 

tac atg tgt get aaa tgc aag aag aag gtt cca gcc age aag cgc ttc 288 

Tyr Met Cys Ala Lys Cys Lys Lys Lys Val Pro Ala Ser Lys Arg Phe 

85 90 95 

60 acc ate cac aga aca tec aac gtc tta acc ctt tec etc aag cgc ttt 336 
Thr lie His Arg Thr Ser Asn Val Leu Thr Leu Ser Leu Lys Arg Phe 
100 105 110 

gcc aac ttc age ggg ggg aag ate acc aag gat gta ggc tat ccg gaa 384 
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Ala Asn Phe Ser Gly Gly Lys lie Thr Lys Asp Val Gly Tyr Pro Glu 
115 120 125 

ttc etc aac ata cgt ccg tat atg tec cag aat aat ggc gat cct gtc 432 
Phe Leu Asn He Arg Pro Tyr Met Ser Gin Asn Asn Gly Asp Pro Val 
130 135 140 



10 



atg tat gga etc tat get gtc ctg gtg cac teg ggc tac age tgc cat 480 
Met Tyr Gly Lieu Tyr Ala Val Leu Val His Ser Gly Tyr Ser Cys His 
145 150 155 160 

15 gec ggg cac tat tac tgc tac gtg aag gca age aat gga cag tgg tac 528 
Ala Gly His Tyr Tyr Cys Tyr Val Lys Ala Ser Asn Gly Gin Trp Tyr 
165 170 175 

cag atg aat gat tec ctg gtc cat tec age aac gtc aag gtg gtt ctg 576 
20 Gin Met Asn Asp Ser Leu Vol His Ser Ser Asn Val Lys Val Val Leu 
180 185 190 

aac cag cag gec tac gtg ctg ttc tat ctg cga att cca ggc tct aag 624 
Asn Gin. Gin Ala Tyr Val Leu Phe Tyr Leu Arg lie Pro Gly Ser Lys 
25 155 200 205 

aaa agt ccc gag ggc etc ate tec agg aca ggc tec tec tec ctt ccc 672 

Lys Ser Pro Glu Gly Leu lie Ser Arg Thr Gly Ser Ser Ser Leu Pro 

210 215 220 

30 

ggc cgc ccg agt gtg att cca gat cac tec aag aag aac ate cgc aat 720 

Gly Arg Pro Ser Val lie Pro Asp His Ser Lys Lys Asn lie Gly Asn 
225 230 235 240 

35 ggg att att tec tec cca ctg act gga aag cga caa gac tct ggg acg 768 
Gly lie lie Ser Ser Pro Leu Thr Gly Lys Arg Gin Asp Ser Gly Thr 
245 250 255 

atg aag aag ccg cac acc act gaa gag att ggt gtg ccc ata tec agg 816 
40 Met Lys Lys Pro His Thr Thr Glu Glu lie Gly Val Pro lie Ser Arg 
260 265 270 

aat ggc tec acc ctg ggc c-g aag tec cag aac ggc tgc att cct cca 864 
Asn Giy Ser Thr Leu Gly Leu Lys Ser Gin Asn Gly Cys lie Pro Pro 
45 275 280 285 

aag ctg ccc teg ggg tec cct tec ccc aaa etc tec cag aca ccc aca 912 

Lys Leu Pro Ser Gly Ser Pro Ser Pro Lys Leu Ser Gin Thr Pro Thr 
290 295 300 

50 

cac atg cca acc ate eta gac gac cct gga aag aag gtg aag aag cca 960 

His Met Pro Thr He Leu Asp Asp Pro Gly Lys Lys Val Lys Lys Pro 
305 310 315 320 

55 get cct cca cag cac ttt tec ccc aga act get cag ggg ctg cct ggg 1008 
Ala Pro Pro Gin His Phe Ser Pro Arg Thr Ala Gin Gly Leu Pro Gly 
325 330 335 

acc age aac teg aat age age aga tct ggg age caa agg cag ggc tec 1056 
60 Thr Ser Asn Ser Asn Ser Ser Arg Ser Gly Ser Gin Arg Gin Gly Ser 
340 345 350 

tgg gac age agg gat gtt gtc etc tct acc tea cct aag etc ctg get 1104 
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Trp Asp Ser Arg Asp Val Val Leu Ser Thr Ser Pro Lys Leu Leu Ala 
355 3SC 365 

aca gcc act gcc aac ggg cat ggg ctg aag ggg aac gac gag age get 1152 
Thr Ala Thr Ala Asn Gly His Gly Leu Lys Gly Asn Asp Glu Ser Ala 
370 375 380 

ggc etc gac agg agg ggc tec age age tec age cca gag cac teg gcc 1200 
Gly Leu Asp Arg Arg Gly Ser Ser Ser Ser Ser Pro Glu His Ser Ala 
385 390 395 400 

15 age age gac tec ace aag gcc ceo cag occ ccc agg agt gga gcg gcc 1248 
Ser Ser Asp Ser Thr Lys Ala Pro Gin Thr Pro Arg Ser Gly Ala Ala 
405 410 415 

cat etc tgc gat tct cag gaa acg aac tgt tec ace get ggc cac tec 1296 
20 His Leu Cys Asp Ser Cln Glu Thr Asn Cys Ser Thr Ala Gly His Ser 
420 425 430 

aaa acg ccg cca agt gga gca gat tct aag acg gtg aag ctg aag tec 1344 
Lys Thr Pro Pro Ser Gly Ala Asp Ser Lys Thr Val Lys Leu Lys Ser 
25 435 440 445 

cct gtc ctg age aac acc ace act gag cct gca age ace atg tct cct 1392 

Pro Val Leu Ser Asn Thr Thr Thr Glu Pro Ala Ser Thr Met Ser Pro 

450 455 460 

30 

cca cca gcc aaa aaa ctg gcc ctt tct gcc aag aag gcc age acc ctg 1440 

Pro Pro Ala Lys Lys Leu Ala Leu Ser Ala Lys Lys Ala Ser Thr Leu 

465 470 475 480 

35 tgg agg gcg acc ggc aat gac etc cgt cca cct ccc ccc tea cca tec 1488 
Trp Arg Ala Thr Gly Asn Asp Leu Arg Pro Pro Pro Pro Ser Pro Ser 
485 490 495 

tec gac etc acc cac ccc atg aaa acc tct cac ccc gtc gtt gcc tec 1536 
40 Ser Asp Leu Thr His Pro Met Lys Thr Ser His Pro Val Val Ala Ser 
500 505 510 

act tgg ccc gtc cat aga gcc agg get gtg tea cct get ccc caa tea 1584 
Thr Trp Pro Val His Arg Ala Arg Ala Val Ser Pro Ala Pro Gin Ser 
45 515 520 525 

tec age cgc ctg caa ccc ccc ttc age ccc cac ccc aca ttg ctg tec 1632 
Ser Ser Arg Leu Gin Pro Pro Phe Ser Pro His Pro Thr Leu Leu Ser 
530 535 540 

50 

agt acc ccc aag ccc cca ggg acg tea gaa cca egg age tgc tec tec 1680 
Ser Thr Pro Lys Pro Pro Gly Thr Ser Glu Pro Arg Ser Cys Ser Ser 
545 550 555 560 

55 ate teg acg gcg ctg cct cag gtc aac gag gac ctt gtg tct ctt cca 1723 
lie Ser Thr Ala Leu Pro Gin Val Asn Glu Asp Leu Val Ser Leu Pro 
565 570 575 

cac cag ttg cca gag gcc agt gag ccc ccc cag age ccc tct gag aag 1776 
60 His Gin Leu Pro Glu Ala Ser Glu Pro Pro Gin Ser Pro Ser Glu Lys 
580 585 590 
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agg aaa aag acc ttt gtg gga gag ccg cag agg ctg ggc tea gag acg 1824 
5 Arg Lys Lys Thr Phe Val Gly Glu Pro Gin Arg Leu Gly Ser Glu Thr 
595 600 605 

cgc etc cca cag cac ate agg gag gec act gcg get ccc cac ggg aag 1872 
Arg Leu Pro Gin His He Arg Glu Ala Thr Ala Ala Pro His Gly Lys 
10 610 615 620 

agg aag agg aag aag aag aag cgc ccg gag gac aca get gec age gec 1920 

Arg Lys Arg Lys Lys Lys Lys Arg Pro Glu Asp Thr Ala Ala Ser Ala 
625 630 635 640 

15 

ctg cag gag ggg cag aca cag aga cag cct ggg age ccc atg cac agg 1968 

Leu Gin Glu Gly Gin Thr Gin Arg Gin Pro Gly Ser Pro Met Tyr Arg 
645 650 655 

20 agg gag ggc cag gca cag ctg ccc get gtc aga ccg cag gaa gat ggc 2016 
Arg Glu Gly Gin Ala Gin Leu Pro Ala Val Arg Arg Gin Glu Asp Gly 
660 665 670 

aca cag cca cag gtg aat ggc cag cag gtg gga tgt gtt acg gac ggc 2064 
25 Thr Gin Pro Gin Val Asn Gly Gin Gin Val Gly Cys Val Thr Asp Gly 
675 630 685 

cac cac gcg age age agg aag egg agg agg aaa gga gca gaa ggt ctt 2112 
His His Ala Scr Ser Arg Lys Arg Arg Arg Lys Gly Ala Glu Gly Leu 
30 690 695 700 

ggt gaa gaa ggc ggc ctg cac cag gac cca ctt egg cac age tgc tct 2160 

Gly Glu Glu Gly Gly Leu Eis Gin Asp Pro Leu Arg His Ser Cys Ser 

705 710 715 720 

35 

ccc atg ggt gat ggt gat cca gag gec atg gaa gag tct cca agg aaa 2208 

Pro Met Gly Asp Gly Asp Pro Glu Ala Met Glu Glu Ser Pro Arg Lys 

725 730 735 

40 aag aaa aaa aaa aaa aac teg agg ggg ggc ccg gta 2244 
Lys Lys Lys Lys Lys Asn Ser Arg Gly Gly Pro Val 
740 745 



45 <21C> 38 

<211> 748 

<212> PRT 

<213> Unknown 

50 <400> 38 

Met Gin Lys Ala Cys Leu Asn Gly Cys Ala Lys Leu Asp Arg Gin Thr 
15 10 15 

Gin Ala Thr Thr Leu Val His Gin lie Phe Gly Gly Tyr Leu Arg Ser 
55 20 25 30 

Arg Val Lys Cys Ser Val Cys Lys Ser Val Ser Asp Thr Tyr Asp Pro 
35 40 45 

60 Tyr Leu Asp Val Ala Leu Glu lie Arg Gin Ala Ala Asn lie Val Arg 
50 55 60 

Ala Leu Glu Leu Phe Val Lys Ala Asp Val Leu Ser Gly Glu Asn Ala 
65 70 75 80 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT/US99/12366 



53 



5 Tyr Met Cys Ala Lys Cys Lys Lys Lys Val Pro Ala Ser Lys Arg Phe 
85 90 95 

Thr lie His Arg Thr Ser Asn Val Leu Thr Leu Ser Leu Lys Arg Phe 
100 105 110 

10 

Ala Asn Phe Ser Gly Gly Lys He Thr Lys Asp Val Gly Tyr Pro Glu 
115 120 125 

Phe Leu Asn He Arg Pro Tyr Met Ser Gin Asn Asn Gly Asp Pro Val 
15 130 135 140 

Met Tyr Gly Leu Tyr Ala Val Leu Val His Ser Gly Tyr Ser Cys His 
145 150 155 16C 

20 Ala Gly His Tyr Tyr Cys Tyr Val Lys Ala Ser Asn Gly Gin Trp Tyr 
165 170 175 

Gin Met Asn Asp Ser r.eu Val His Ser Ser Asn Val Lys Val Val Leu 
180 185 190 

25 

Asn Gin Gin Ala Tyr Val Leu Phe Tyr Leu Arg He Pro Gly Ser Lys 
195 200 205 

Lys Ser Pro Glu Gly Leu He Ser Arg Thr Gly Ser Ser Ser Leu Pro 
30 210 215 220 

Gly Arg Pro Ser Val He Pre Asp His Ser Lys Lys Asn He Gly Asn 
225 230 235 240 

3 5 Gly He He Ser Ser Pro Leu Thr Gly Lys Arg Gin Asp Ser Gly Thr 
245 250 255 

Met Lys Lys Pro His Thr Thr Glu Glu He Gly Val Pro He Ser Arg 
260 265 270 

40 

Asn Gly Ser Thr Leu Gly Leu Lys Ser Gin Asn Gly Cys He Pro Pro 
275 280 285 

Lys Leu Pro Ser Gly Ser Pro Ser Pro Lys Leu Ser Gin Thr Pro Thr 
45 290 295 300 

His Met Pro Thr He Leu Asp Asp Pro Gly Lys Lys Val Lys Lys Pro 
305 310 315 320 

50 Ala Pro Pro Gin His Phe Ser Pro Arg Thr Ala Gin Gly Leu Pro Gly 
325 330 335 

Thr Ser Asn Ser Asn Ser Ser Arg Ser Gly Ser Gin Arg Gin Gly Ser 
340 345 350 

55 

Trp Asp Ser Arg Asp Val Val Leu Ser Thr Ser Pro Lys Leu Leu Ala 
355 360 365 

Thr Ala Thr Ala Asn Gly His Gly Leu Lys Gly Asn Asp Glu Ser Ala 
60 370 375 330 

Gly Leu Asp Arg Arg Gly Ser Ser Ser Ser Ser Pro Glu His Ser Ala 
385 390 395 400 
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5 Ser Ser Asp Ser Thr Lys Ala Pro Gin Thr Pro Arg Ser Gly Ala Ala 
405 410 415 

His Leu Cys Asp Ser Gin Glu Thr Asn Cys Ser Thr Ala Gly His Ser 
420 425 430 

10 

Lys Thr Pro Pro Ser Gly Ala Asp Ser Lys Thr Val Lys Leu Lys Ser 
435 440 445 

Pro Val Leu Ser Asn Thr Thr Thr Glu Pro Ala Ser Thr Met Ser Pro 
15 450 455 460 

Pro Pro Ala Lys Lys Leu Ala Leu Ser Ala Lys Lys Ala Ser Thr Leu 
465 470 475 480 

20 Trp Arg Ala Thr Gly Asn Asp Leu Arg Pro Pro Pro Pru Ser Pro Ser 
485 490 495 

Ser Asp Leu Thr His Pro Met Lys Thr Ser His Pro Val Val Ala Ser 
500 505 510 

25 

Thr Trp Pro Val His Arg Ala Arg Ala Val Ser Pro Ala Pro Gin Ser 
515 520 525 

Ser Ser Arg Leu Gin Pro Pro Phe Ser Pro His Pro Thr Leu Leu Ser 
30 530 535 540 

Ser Thr Pro Lys Pro Pro Gly Thr Ser Glu Pro Arg Ser Cys Ser Ser 
545 S50 555 560 

35 lie Ser Thr Ala Leu Pro Gin Val Asn Glu Asp Leu Val Ser Leu Pro 
565 570 575 

His Gin Leu Pro Glu Ala Ser Glu Pro Pro Gin Ser Pro Ser Glu Lys 
530 585 590 

40 

Arg Lys Lys Thr Phe Val Gly Glu Pro Gin Arg Leu Gly Ser Glu Thr 
595 600 605 

Arg Leu Pro Gin His lie Arg Glu Ala Thr Ala Ala Pro His Gly Lys 
45 610 615 620 

Arg Lys Arg Lys Lys Lys Lys Arg Pro Glu Asp Thr Ala Ala Ser Ala 
625 630 635 640 

50 Leu Gin Glu Gly Gin Thr Gin Arg Gin Pro Gly Ser Pro Met Tyr Arg 
645 650 655 

Arg Glu Gly Gin Ala Gin Leu Pro Ala Val Arg Arg Gin Glu Asp Gly 
660 665 670 

55 

Thr Gin Pro Gin Val Asn Gly Gin Gin Val Gly Cys Val Thr Asp Gly 
675 680 685 

His His Ala Ser Ser Arg Lys Arg Arg Arg Lys Gly Ala Glu Gly Leu 
60 690 695 700 

Gly Glu Glu Gly Gly Leu His Gin Asp Pro Leu Arg His Ser Cys Ser 
705 710 715 " 720 
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Pro Met Gly Asp Gly Asp Pro Glu Ala Met Glu Glu Ser Pro Arg Lys 
5 725 730 735 

Lys Lys Lys Lys Lys Asn Ser Arg Gly Gly Pro Val 
740 745 

10 

<210> 39 
<211> 526 
<212> PRT 
<213> Unknown 

15 

<220> 

<223> Description of Unknown Organism:primate 
<400> 39 

20 Met Val Val Ala Leu Ser Phe Pro Glu Ala Asp Pro Ala Leu Ser Ser 
15 10 15 

Pro Asp Ala Pro Glu Leu His Gin Asp Glu Ala Gin Val Val Glu Glu 
20 25 30 

25 

Leu Thr Val Asn Gly Lys His Ser Leu Ser Trp Glu Ser Pro Gin Gly 
35 40 45 

Pro Gly Cys Gly Leu Gin Agn Thr Gly Asn Ser Cys Tyr Leu Asn Ala 
30 50 55 60 

Ala Leu Gin Cys Leu Thr His Thr Pro Pro Leu Ala Asp Tyr Met Leu 
65 70 75 80 

35 Ser Gin Glu His Ser Gin Thr Cys Cys Ser Pro Glu Gly Cys Lys Leu 
85 90 95 

Cys Ala Met Glu Ala Leu Val Thr Gin Ser Leu Leu His Ser His Ser 
100 105 110 

40 

Gly Asp Val Met Lys Pro Ser His lie Leu Thr Ser Ala Phe His Lys 
115 120 125 

His Gin Gin Glu Asp Ala His Glu Phe Leu Met Pile Thr Leu Glu Thr 
45 130 135 140 

Met His Glu Ser Cys Leu Gin Val His Arg Gin Ser Lys Pro Thr Ser 
145 150 155 160 

50 Glu Asp Ser Ser Pro lie His Asp lie Phe Gly Gly Trp Trp Arg Ser 
165 170 175 

Gin lie Lys Cys Leu Leu Cys Gin Gly Thr Ser Asp Thr Tyr Asp Arg 
130 185 190 

55 

Phe Leu Asp lie Pro Leu Asp lie Ser Ser Ala Gin Ser Val Lys Gin 
195 200 205 

Ala Leu Tro Asp Thr Glu Lys Ser Glu Glu Leu Cys Gly Asp Asn Ala 
60 210 215 220 

Tyr Tyr Cys Gly Lys Cys Arg Gin Lys Met Pro Ala Ser Lys Thr Leu 
225 230 233 240 
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His Val His He Ala Pro Lys Val Leu Met Val Val Leu Asn Arg Phe 
5 245 250 255 

Ser Ala Phe Thr Gly Asn Lys Leu Asp Arg Lys Val Ser Tyr Pro Glu 
260 265 270 

10 Phe Leu Asp Leu Lys Pro Tyr Lou Ser Glu Pro Thr Gly Gly Pro Leu 
275 280 285 

Pro Tyr Ala Leu Tyr Ala Val Leu Val His Asp Gly Ala Thr Ser His 
290 295 300 

15 

Ser Gly His Tyr Phe Cys Cys Val Lys Ala Gly His Gly Lys Trp Tyr 
305 310 315 320 

Lys Met Asp Asp Thr Lys Val Thr Arg Cys Asp Val Thr Ser Val Leu 
20 325 330 335 

Asn Glu Asn Ala Tyr Val Leu Phe Tyr Val Gin Gin Ala Asn Leu Lys 
340 345 350 

25 Gin Val Ser He Asp Met Pro Glu Gly Arg lie Asn Glu Val Leu Asp 
355 360 365 

Pro Glu Tyr Gin Leu Lys Lys Ser Arg Arg Lys Lys His Lys Lys Lys 
370 375 380 

30 

Ser Pro Phe Thr Glu Asp Leu Gly Glu Pro Cys Glu Asn Arg Asp Lys 
385 390 395 400 

Arg Ala He Lys Glu Thr Ser Leu Gly Lys Gly Lys Val Leu Gin Glu 
35 405 410 415 

Val Asn His Lys Lys Ala Gly Gin Lys His Gly Asn Thr Lys Leu Met 
420 425 430 

40 Pro Gin Lys Gin Asn His Gin Lys Ala Gly Gin Asn Leu Arg Asn Thr 
435 440 445 

Glu Val Glu Leu Asp Leu Pro Ala Asp Ala He Val He His Gin Pro 
450 455 460 

45 

Arg Ser Thr Ala Asn Trp Gly Arg Asp Ser Pro Asp Lys Glu Asn Gin 
465 470 475 480 

Pro Leu His Asn Ala Asp Arg Leu Leu Thr Ser Gin Gly Pro Val Asn 
50 485 490 495 

Thr Trp Gin Leu Cys Arg Gin Glu Gly Arg Arg Arg Ser Lys Lys Gly 
500 505 510 

55 Gin Asn Lys Asn Lys Gin Gly Gin Arg Leu Leu Leu Val Cys 
515 520 525 



<210> 40 

60 <211> 545 

<212> PRT 

<213> Unknown 



SUBSTITUTE SHEET ( rule 26 ) 



WO 00/01817 



PCT7US99/12366 



57 

<223> Description of Unknown Organism: primate 

5 

<400> 4C 

Met Val Val Ser Leu Ser Phe Pro Glu Ala Asp Pro Ala Leu Ser Ser 
15 10 15 

10 Pro Cly Ala Gin Gin Leu His Gin Asp Glu Ala Gin Val Val Val Glu 
20 25 30 

Leu Thr Ala Asn Asp Lys Pro Ser Leu Ser Trp Glu Cys Pro Gin Gly 
35 40 45 

15 

Pro Gly Cys Gly Leu Gin Asn Thr Gly Asn Ser Cys Tyr Leu Asn Ala 
50 55 60 

Ala Leu Gin Cys Leu Thr His Thr Pro Pro Leu Ala Asp Tyr Met Leu 
20 65 70 75 80 

Ser Gin Glu Tyr Ser Gin Thr Cys Cys Ser Pro Glu Gly Cys Lys Met 
85 90 95 

25 Cys Ala Met Glu Ala His Val Thr Gin Ser Leu Leu His Ser His Ser 
100 105 110 

Gly Asp Val Met Lys Pro Ser Gin lie Leu Thr Scr Ala Phe His Lys 
115 120 125 

30 

His Gin Gin Glu Asp Ala His Glu Phe Leu Met Phe Thr Leu Glu Thr 
130 135 140 

Met His Glu Ser Cys Leu Gin Val His Arg Gin Ser Glu Pro Thr Ser 
35 145 150 155 160 

Glu Asp Ser Ser Pro lie His Asp lie Phe Gly Gly Leu Trp Arg Ser 
165 170 175 

40 Gin lie Lys Cys Leu His Cys Gin Gly Thr Ser Asp Thr Tyr Asp Arg 
180 185 190 

Phs Leu Asp Val Pro Leu Asp lie Ser Ser Ala Gin Ser Val Asn Gin 
195 200 205 

45 

Ala Leu Trp Asp Thr Glu Lys Ser Glu Glu Leu Arg Gly Glu Asn Ala 
210 215 220 

Tyr Tyr Cys Gly Arg Cys Arg Gin Lys Met Pro Ala Ser Lys Thr Leu 
50 225 230 235 240 

His lie His Ser Ala Pro Lys Val Leu Leu Leu Val Leu Lys Arg Phe 
245 250 255 

55 Ser Ala Phe Met Gly Asn Lys Leu Asp Arg Lys Val Ser Tyr Pro Glu 
260 265 270 

Phe Leu Asp Leu Lys Pro Tyr Leu Ser Gin Pro Thr Gly Gly Pro Leu 
275 280 285 

60 

Pro Tyr Ala Leu Tyr Ala Val Leu Val His Glu Gly Ala Thr Cys His 
290 295 300 
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Ser Gly His Tyr Phe Ser Tyr Val Lys Ala Arg His Gly Ala Trp Tyr 
5 305 310 315 320 

Lys Mec Asp Asp Thr Lys Val Thr Ser Cys Asp Val Thr Ser Val Leu 
325 330 335 

10 Asn Glu Asr. Ala Tyr Val Leu Phe Tyr Val Gin Gin Thr Asp Leu Lys 
340 345 350 

Gin Val Ser He Asp Met Pro Glu Gly Arg Val His Glu Val Leu Asp 
355 360 365 

15 

Pro Glu Tyr Gin Leu Lys Lys Ser Arg Arg Lys Lys His Lys Lys Lys 
370 375 380 

Ser Pro Cys Thr Glu Asp Ala Gly Glu Pro Cys Lys Asn Arg Glu Lys 
20 385 390 395 400 

Arg Ala Thr Lys Glu Thr Ser Leu Gly Glu Gly Lys Val Xaa Gin Glu 
405 410 415 

25 Lys Asn His Lys Lys Ala Gly Gin Lys His Glu Asn Thr Lys Leu Val 
420 425 430 

Pro Gin Glu Gin Asn His Gin Lys Leu Gly Gin Lys His Arg lie Asn 
435 440 445 

30 

Glu He Leu Pro Gin Glu Gin Asn His Gin Lys Ala Gly Gin Ser Leu 
450 455 460 

Axg Asn Thr Glu Gly Glu Leu Asp Leu Pro Ala Asp Ala He Val He 
35 465 470 475 480 

His Leu Leu Arg Ser Thr Glu Asn Trp Gly Arg Asp Ala Pro Asp Lys 
485 490 495 

40 Glu Asn Gin Pro Trp His Asn Ala Asp Arg Leu Leu Thr Ser Gin Asp 
500 505 510 

Pro Val Asn Thr Gly Gin Leu Cys Arg Gin Glu Gly Arg Arg Arg Ser 
515 520 525 

45 

Lys Lys Gly Lys Asn Lys Asn Lys Gin Gly Gin Arg Leu Leu Leu Val 
530 535 540 

Cys 
50 545 



<210> 41 

<211> 890 

55 <212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism:primate 

60 

<22C> 

<221> CDS 

<222> (15).. (500) 
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<220> 

5 <221> misc_f eature 
<222> (53) 

<223> nucleotide may be A or C 
<220> 

10 <221> misc_feature 
<222> (123> 

<223> nucleotide may be C or G 
<220> 

15 <221> misc_£eature 
<222> (124) 

<223> nucleotide may be G or T 
<220> 

20 -:221^ misc_£eature 
<222> <125) 

<223> nucleotide may be C or T 
<220> 

25 <221> misc_f eature 
<222> (525) 

<223> nucleotide may be A, C, G, or T 
<220> 

30 <221> misc_feature 
<222> (547) 

<223> nucleotide may be A, C. G, or T 
<220> 

35 <221> misc_feature 
<222> (835) 

<223> nucleotide may be A, C, G, or T 
<400> 41 

40 ggcacgagcc cacc atg aag ggt ttc aca gcc act etc ttc etc tgg act 50 
Met Lys Gly Phe Thr Ala Thr Leu Phe Leu Trp Thr 
15 10 

etc att ttt ccc age tgc agt gga ggc ggc ggt ggg aaa gcc tgg ccc 9 8 
45 Leu lie Phe Pro Ser Cys Ser Gly Gly Gly Gly Gly Lys Ala Trp Pro 
15 20 25 

aca cac gtg gtc tgt age gac age cgc ttg gaa gtg etc tac cag agt 146 
Thr His Val Val Cys Ser Asp Ser Arg Leu Glu Val Leu Tyr Gin Ser 
50 30 35 40 

tgc gat cca tta caa gat ttt ggc ttt tct gtt gaa aag tgt tec aag 194 
Cys Asp Pro Leu Gin Asp Phe Gly Phe Ser Val Glu Lys Cys Ser Lys 
45 50 55 60 



55 



caa tta aaa tea aat ate aac att aga ttt gga att att ctg aga gag 242 
Gin Leu Lys Ser Asn lie Asn lie Arg Phe Gly lie He Leu Arg Glu 
65 70 75 



60 gac ate aaa gag ctt ttt ctt gac eta get etc atg tct caa ggc tea 290 
Asp lie Lys Glu Leu Phe Leu Asp Leu Ala Leu Met Ser Gin Gly Ser 
90 85 90 
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tct gtt ttg aat ttc tec Cat ccc ate tgt gag gcg get ctg ccc aag 338 
5 Ser Val Leu Asn Phe Ser Tyr Pro lie Cys Glu Ala Ala Leu Pro Lys 
95 100 105 

ttt tct ttc tgt gga aga agg aaa gga gag cag att tac tat get ggg 386 
Phe Ser Phe Cys Gly Arg Arg Lys Gly Glu Gin lie Tyr Tyr Ala Gly 
10 110 115 120 

cct gtc aat aa-. cct gaa ttt act att cct cag gga gaa tac cag gtt 434 

Pro Val Asn Asn Pro Glu Phe Thr lie Pro Gin Gly Glu Tyr Gin Val 

125 130 135 140 

15 

ttg ctg gaa ctg toe act gaa aaa egg tec acc gtg gec tgt gee aat 482 

Leu Leu Glu Leu Tyr Thr Glu Lys Arg Ser Thr Val Ala Cys Ala Asn 

145 150 .155 

20 get act ate atg tgc tec tgactgtggg cctgttagca aaaactcaca 530 
Ala Thr lie Met Cys Ser 
160 



25 



gccagctgca tetegteggg aaccttccaa gctcctctga ctgaacccac tgtgggagga 590 
gaagcagctg atgacagaga gaggctctac aaagaagege ccccaaagag tgeagctget 650 
aattttagtc ccaggaccag acatccccag actccaeaga tgtaatgaag tccccgaatg 710 
30 tatctgtttc taaggagect cttggcagcc cttaagcagt cttgagggtc catccttttt 770 
ctctaattgg tcgcctccca ccagactcac ctgettttea actttttagg agtgcttcct 830 
cacacgttac caataataaa gaaagctggc caccaaaaaa ananaoaaaa aaaaaaaaaa 890 

35 

<210> 42 
<211> 162 
<212> PRT 
40 <213> Unknown 

<400> 42 

Met Lys Gly Phe Thr Ala Thr Leu Phe Leu Trp Thx Leu lie Phe Pro 
1 5 10 15 



45 



60 



Ser Cys Ser Gly Gly Gly Gly Gly Lys Ala Trp Pro Thr His Val Val 
20 25 30 



Cys Ser Asp Ser Arg Leu Glu Val Leu Tyr Gin Ser Cys Asp Pro Leu 

50 35 40 45 

Gin Asp Phe Gly Phe Ser Val Glu Lys Cys Ser Lys Gin Leu Lys Ser 

50 55 60 

55 Asn lie Asn He Arg Phe Gly He He Leu Arg Glu Asp He Lys Glu 

65 70 75 80 



Leu Phe Leu Asp Leu Ala Leu Met Ser Gin Gly Ser Ser Val Leu Asn 
85 90 95 

Phe Ser Tyr Pro He Cys Glu Ala Ala Leu Pro Lys Phe Ser Phe Cys 
100 105 110 
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Gly Arg Arg Lys Gly Glu Gin lie Tyr Tyr Ala Gly Pro Val Asn Asn 
5 115 120 125 

Pro Glu Phe Thr lie Pro Gin Gly Glu Tyr Gin Val Leu Leu Glu Leu 
130 135 140 

10 Tyr Thr Glu Lys Arg Ser Thr Val Ala Cys Ala Asn Ala Thr lie Met 
145 150 155 160 



15 



45 



Cys Ser 



<210> 43 

<211> 486 

<212> DMA 

20 <213> Unknown 

<220> 

<223> Description of Unknown Organism: primate 

25 <220> 

<221> CDS 

<222> (1) . . (132) 

<400> 43 

30 ccc ctg ttt tct tec ata ttt act gaa get cag aag cag tat tgg gtc 48 

Pro Leu Plae Ser Ser lie Phe Thr Glu Ala Gin Lys Gin Tyr Trp Val 

15 10 15 

tgc aac tea tec gat gca agt att tea tac acc tac tgt gat aaa atg 96 
35 Cys Asn Ser Ser Asp Ala Ser lie Ser Tyr Thr Tyr Cys Asp Lys Met 
20 25 30 

caa tac cca att tea att aat gtt aac ccc tgt ata gaattgaaag 142 
Gin Tyr Pro He Ser He Asn Val Asn Pro Cys He 
40 35 40 

gatccaaagg attattgeae attttctaca ttccaaggag agatttaaag caattatatt 202 

tcaatctcta tataactgtc aacaccatga atcttccaaa gcgcaaagaa gttatttgee 262 

gaggatctga tgacgattac tetttttgea gagctctgaa gggagagact gtgaatacaa 322 

caatatcatt ctccttcaag ggaataaaat tttctaaggg aaaatacaaa tgcgttgttg 382 

50 aagctatttc tgggagecca gaagaaatgc tettttgett ggagtttgtc atcctacacc 442 

aacctaattc aaattagaat aaattgagta tttaaaaaaa aaaa 486 

55 <210> 44 
<211> 44 
<212> PRT 
<213> Unknown 

60 <400> 44 

Pro Leu Phe Ser Ser He Phe Thr Glu Ala Gin Lys Gin Tyr Trp Val 
15 10 15 
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15 



55 



62 

Cys Asn Ser Ser Asp Ala Ser lie Ser Tyr Thr Tyr Cys Asp Lys Met 
20 25 30 

Gin Tyr Pro lie Ser He Asn Val Asn Pro Cys He 
35 40 



<210> 4= 
<211> 483 
<212> DKA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism:primate 



<220> 
20 <221> CDS 

<222> (1)..(480) 

<400> 45 

atg ttc cca ttt ctg ttt ttt tec acc ctg ttt tct tec aca ttt act 48 

25 Met Phe Pro Phe Leu Phe Phe Ser Thr Leu Phe Ser Ser He Phe Thr 
15 10 15 

gaa get cag aag cag tat tgg gtc tgc aac tea tec gat gca agt att 96 
Glu Ala Gin Lys Gin Tyr Trp Val Cys Asn Ser Ser Asp Ala Ser He 
30 20 25 30 

tea tac acc tac tgt gat aaa atg caa tac cca att tea att aat gtt 144 

Ser Tyr Thr Tyr Cys Asp Lys Met Gin Tyr Pro He Ser He Asn Val 
35 40 45 

35 

aac ccc tgt ata gaa ttg aaa gga tec aaa gga tta ttg cac att ttc 192 

Asn Pro Cys He Glu Leu Lys Gly Ser Lys Gly Leu Leu His He Phe 
50 55 60 

40 tac att cca agg aga gat tta aag caa tta tat ttc aat etc tat ata 240 
Tyr He Pro Arg Arg Asp Leu Lys Gin Leu Tyr Phe Asn Leu Tyr He 
65 70 75 80 

act gtc aac acc atg aat ctt cca aag cgc aaa gaa gtt att tgc cga 288 
45 Thr Val Asn Thr Met Asn Leu Pro Lys Arg Lys Glu Val He Cys Arg 
85 90 95 

gga tct gat gac gat tac tct ttt tgc aga get ctg aag gga gag act 336 
Gly Ser Asp Asp Asp Tyr Ser Phe Cys Arg Ala Leu Lys Gly Glu Thr 
50 100 105 110 

gtg aat aca aca ata tea ttc tec ttc aag gga ata aaa ttt tct aag 384 
Val Asn Thr Thr He Ser Phe Ser Phe Lys Gly He Lys Phe Ser Lys 
115 120 125 



gga aaa tac aaa tgt gtt gtt gaa get att tct ggg age cca gaa gaa 432 
Gly Lys Tyr Lys Cys Val Val Glu Ala He Ser Gly Ser Pro Glu Glu 
130 135 140 



60 atg etc ttt tgc ttg gag ttt gtc ate eta cac caa cct aat tea aat 480 

Met Leu Phe Cys Leu Glu Phe Val He Leu His Gin Pro Asn Ser Asn 

145 150 155 160 

tag 483 
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5 <210> 46 
<211> 160 
<212> PRT 
<213> Unknown 

10 <400> 46 

Met Phe Pro Phe Leu Phe 
1 5 

Glu Ala Gin Lys Gin Tyr 
15 20 

Ser Tyr Thr Tyr Cys Asp 
35 

20 Asn Pro Cys lie Glu Leu 
50 

Tyr lie Pro Arg Arg Asp 
65 70 

25 

Thr Val Asn Thr Met Asn 
85 

Gly Ser Asp Asp Asp Tyr 
30 100 

Val Asn Thr Thr lie Ser 
115 

35 Gly Lys Tyr Lys Cys val 

130 

Met Leu Phe Cys Leu Glu 
145 150 

40 

<210> 47 
<211> 498 
<212> DNA 
45 <213> Unknown 

<220> 

<223> Description of Unknown Organism: rodent 

50 <220> 

<221> CDS 

<222> (53) . . (394) 

<400> 47 

55 gtcgagtccg atggtcttcc tggcgagttt aaagtatcgg agatattaaa tc atg ttg 58 

Met Leu 
1 

cca ttt att etc ttt teg acg ctg ctt tct ccc ata ttg act gaa tct 106 
60 Pro Phe lie Leu Phe Ser Thr Leu Leu Ser Pro lie Leu Thr Glu Ser 
5 10 15 

gag aag caa cag tgg ttc tgc aac tec tec gat gca att att tec tac 154 



Phe Ser Thr Leu Phe 
10 

Trp Val Cys Asn Ser 
25 

Lys Met Gin Tyr Pro 
•40 



Lys Gly Ser Lys Gly 
55 

Leu Lys Gin Leu Tyr 
75 

Leu Pro Lys Arg Lys 
90 

Ser Phe Cys Arg Ala 
105 

Phe Ser Phe Lys Gly 
120 

Val Glu Ala lie Ser 
135 

Phe Val lie Leu His 
155 



Ser Ser He Phe Thr 
15 

Ser Asp Ala Ser He 
30 

He Ser He Asn Val 
45 



Leu Leu His He Phe 
60 

Phe Asn Leu Tyr He 
80 

Glu Val He Cys Arg 
95 

Leu Lys Gly Glu Thr 
110 

He Lys Phe Ser Lys 
125 

Gly Ser Pro Glu Glu 

140 

Gin Pre Asn Ser Asn 
160 
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64 

Glu Lys Gin Gin Trp Phe Cys Asn Ser Ser Asp Ala lie He Ser Tyx 
20 25 30 

agt tat tgt gat cac ttg aaa tec cct att tea att agt tct gaa ccc 202 
Ser Tyr Cys Asp His Leu Lys Phe Pro He Ser He Ser Ser Glu Pro 
35 40 45 50 

tgc ata aga ctg agg gga acc aat gga ttt gtg cat gtt gag ttc att 250 
Cys He Arg Leu Arg Gly Thr Asn Gly Phe Val His Val Glu Phe He 
55 60 65 

15 cca aga gga aac tta aaa tat tta tat ttc aac eta ttc ate agt gtc 298 
Pro Arg Gly Asn Leu Lys Tyr Leu Tyr Phe Asn Leu Phe He Ser Val 
70 75 80 

aac tec ata gag ttg ccg aag cgt aag gaa gtt ctg tgc cat gga cat 346 
20 Asn Ser He Glu Leu Pro Ly3 Arg Lys Glu Val Leu Cys His Gly His 
35 90 95 

gat gat gac tat tct ttt tgc aga get ctg aaa gga gga tat get att 394 
Asp Asp Asp Tyr Ser Phe Cys Arg Ala Leu Lys Gly Gly Tyr Ala He 
25 100 105 110 

tagaaaatat gagactgtga atacatcaat accattctct ttcgagggaa tactatttcc 454 

taagggecat tacagatgtg ttgcagaaoc tattgctggg gata 498 

30 

<210> 48 
<211> 114 
<212> PRT 
35 <213> Unknown 

<400> 48 

Met Leu Pro Phe He Leu Phe Ser Thr Leu Leu Ser Pro He Leu Thr 
15 10 15 



40 



55 



60 



Glu Ser Glu Lys Gin Gin Trp Phe Cys Asn Ser Ser Asp Ala ~le He 
20 25 30 



Ser Tyr Ser Tyr Cys Asp Kis Leu Lys Phe Pro He Ser He Ser Ser 

45 35 40 45 

Glu Pro Cys He Arg Leu Arg Gly Thr Asn Gly Phe Val His Val Glu 

50 55 60 

50 Phe He Pro Arg Gly Asn Leu Lys Tyr Leu Tyr Phe Asn Leu Phe He 

65 70 75 80 



Ser Val Asn Ser He Glu Leu Pro Lys Arg Lys Glu Val Leu Cys His 
85 90 95 

Gly His Asp Asp Asp Tyr Ser Phe Cys Arg Ala Leu Lys Gly Gly Tyr 
100 105 110 

Ala He 



<210> 49 
<211> 141 
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<212> PRT 
5 <213> Unknown 

<220> 

<223> Description of Unknown Organism: primate 
10 <400> 49 

Met Leu Pro Phe lie Leu Phe Ser Thr Leu Leu Ser Pro lie Leu Thr 
15 10 15 

Glu Ser Glu Lys Gin Gin Trp Phe Cys Asn Ser Ser Asp Ala lie lie 
15 20 25 30 

Ser Tyr Ser Tyr Cys Asp His Leu Lys Phe Pro lie Ser lie Ser Ser 
35 40 45 

20 Glu Pro Cys lie Arc Leu Arg Gly Thr Asn Gly Phe Val His Val Glu 
50 " 55 60 

Phe lie Pro Arg Gly Asn Leu Lys Tyr Leu Tyr Phe Asn Leu Phe lie 
65 70 "75 80 

25 

Ser Val Asn Ser lie Glu Leu Pro Lys Arg Lys Glu Val Leu Cys His 
85 90 95 

Gly His Asp Asp Asp Tyr Ser Phe Cys Arg Ala Leu Lys Gly Glu Thr 
30 100 105 110 

Val Asn Thr Ser lie Pro Phe Ser Phe Glu Gly lie Leu Phe Pro Lys 
115 120 125 

35 Gly His Tyr Arg Cys Val Ala Glu Ala lie Ala Gly Asp 
130 135 140 



<210> 50 
40 <211> 162 
<212> PRT 
<213> Unknown 

<220> 

45 <223> Description of Unknown Organism: rodent 
<400> 50 

Met Asn Gly Val Ala Ala Ala Leu Leu Val Trp He Leu Thr Ser Pro 
15 10 IS 

50 

Ser Ser Ser Asp His Gly Ser Glu Asn Gly Trp Pro Lys His Thr Ala 
20 25 30 

Cys Asn Ser Gly Gly Leu Glu Val Val Tyr Gin Ser Cys Asp Pro Leu 
55 35 40 45 

Gin Asp Phe Gly Leu Ser He Asp Gin Cys Ser Lys Gin He Gin Ser 
50 55 60 

SO Asn Leu Asn lie Arg Phe Gly He He Leu Arg Gin Asp He Arg Lys 
65 70 75 80 

Leu Phe Leu Asp He Thr Leu Met Ala Lys Gly Ser Ser He Leu Asn 
85 90 95 
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5 Tyr Ser Tyr Pro Leu Cys Glu 
100 



Gly Arg Arg Lys Gly Glu Gin 
115 

10 

Pro Gly Leu Asp Val Pro Gin 
130 135 

Tyr Asn Glu Asn Arg Ala Thr 
15 145 150 

Ser Ser 



66 



Glu Asp Gin Pro Lys Phe Ser Phe Cys 
105 110 

lie Tyr Tyr Ala Gly Pro Val Asn Asn 
120 125 



Gly Glu Tyr Gin Leu Leu Leu Glu Leu 
140 



Val Ala Cys Ala Asn Ala Thr Val Thr 
155 160 



25 



20 

<210> 51 
<211> 1158 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: avian 

<220> 
30 <221> CDS 

<222> (11) . . (490) 

<400> 51 

cggtgcaacc atg aag aca ctg aat get etc get etc gtc tta gtc ctg 49 

35 Met Lys Thr Leu Asn Val Leu Ala Leu Val Leu Val Leu 

15 10 

ctt tgc ate aat gec age aca gag tgg cct aca cac aca gtc tgc aag 97 
Leu Cys lie Asn Ala Ser Thr Glu Trp Pro Thr His Thr Val Cys Lys 
40 15 20 25 

gag gaa aac ttg gag ata tat tac aaa age tgt gat ccc cag cad gac 145 

Glu Glu Asn Leu Glu lie Tyr Tyr Lys Ser Cys Asp Pro Gin Gin Asp 

30 35 40 45 

45 

ttt get ttc age att gac cgt tgt tea gat gtc aca ace cac acc ttt 193 

Phe Ala Phe Ser lie Asp Arg Cys Ser Asp Val Thr Thr His Thr Phe 
50 55 60 

50 gac ate aga get gca atg gtc eta aga caa age ate aag gaa ctg tat 241 
Asp lie Arg Ala Ala Met Val Leu Arg Gin Ser lie Lys Glu Leu Tyr 
65 70 75 

gee aag gtt gat ctg ate ata aat ggg aag act gtc tta age tac tea 289 
55 Ala Lys Val Asp Leu lie lie Asn Gly Lys Thr Val Leu Ser Tyr Ser 
80 85 90 

gag aca etc tgt gga cea ggc ctt tct aag eta att ttc tgt gga aag 337 
Glu Thr Leu Cys Gly Pro Gly Leu Ser Lys Leu lie Phe Cys Gly Lys 
60 95 100 105 

aag aaa gga gaa cat etc tac tat gag gga cca ate aca ctg gga ate 385 
Lys Lys Gly Glu His Leu Tyr Tyr Glu Gly Pro lie Thr Leu Gly lie 
110 115 120 125 
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5 aaa gaa ate cca cag cga gat tac act otc aca gca agg ctg act aac 433 
Lys Glu He Pro Gin Arg Asp Tyr Thr He Thr Ala Arg Leu Thr Asn 
130 135 140 

gaa gat cgc gec act gtt get tgt get gat ttt acc gtg aaa aat tat 481 
10 Glu Asp Arg Ala Thr Val Ala Cys Ala Asp Phe Thr Val Lys Asn Tyr 
145 150 155 

tta gat tat taagcaaaac aacgcactcg gtccgactcc cttaaaacta 530 
Leu Asp Tyr 
15 160 



20 



25 



30 



35 



40 <210> 52 
<211> 160 
<212> PRT 
<213> Unknown 

45 <40O> 52 

Met Lys Thr Leu Asn VaL Leu Ala Leu Val Leu Val Leu Leu Cys He 
15 10 15 

Asn Ala Ser Thr Glu Trp Pro Thr His Thr Val Cys Lys Glu Glu Asn 
50 20 25 30 

Leu Glu He Tyr Tyr Lys Ser Cys Asp Pro Gin Gin Asp Phe Ala Phe 
35 40 45 

55 Ser He Asp Arg Cys Ser Asp Val Thr Thr His Thr Phe Asp He Arg 
50 55 60 



60 



cagattccta 


aaactattca 


agcccagtga 


getgettgea 


tgcttcagtg 


attctgaagg 


590 


aaagatctcc 


cgcacggtgg 


ttctgatget 


gttcctcttc 


gtaattcaac 


ttttttggag 


650 


aagtcactag 


gccctaccct 


ctagtggtaa 


ttttatctcc 


aaatgeaetc 


tgtagcccac 


710 


ttttcgcttt 


taatatatac 


agetgeaaat 


agaaagtatt 


tgataccaac 


attctcatct 


770 


caggatgaaa 


atagtacaaa 


gcagaagagg 


egagagecaa 


aacagatttt 


tgeagtaage 


830 


tatggaggta 


tccatttcta 


acacaagcta 


aagaagattg 


tcatatgtat 


tatgeagtta 


890 


tagcac tcaa 


cattttcagt 


ttttcacaag 


gcctgtttgg 


agcctccatt 


ggtataaatt 


950 


ttgttgtaac 


cacagaacaa 


agaccaaata 


ggatgaacat 


ggctccatgt 


tcagtcactc 


1010 


tattcatatc 


atttaagttt 


tcatgattc: 


tcttgtatat 


ttttttttat 


tctttaatgt 


1070 


ttacagtgat 


gtgagaatcc 


ttt-gtttaa 


getacatget 


gttcccgctt 


gtcaataaat 


1130 


ctgcaagaaa 


aaaaaaaaaa 


aaaaaaaa 








11S8 



Ala Ala Met Val Leu Arg Gin Ser He Lys Glu Leu Tyr Ala Lys Val 
65 70 75 80 

Asp Leu He He Asn Gly Lys Thr Val Leu Ser Tyr Ser Glu Thr Leu 
85 90 95 
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Cys Gly Pro Gly Leu Ser Lys Leu He Phe Cys Gly Lys Ly3 Lya Gly 
5 100 105 110 

Glu His Leu Tyr Tyr Glu Gly Pro He Thr Leu Gly He Lys Glu He 
115 120 125 

10 Pro Gin Arg Asp Tyr Thr He Thr Ala Arg Leu Thr Asn Glu Asp Arg 
130 135 140 



IB 



Ala Thr Val Ala Cys Ala Asp Phe Thr Val Lys Asn Tyr Leu Asp Tyr 
145 150 155 160 



<210> 53 
<211> 2684 
<212> DNA 
20 <213> Unknown 

<220> 

<223> Description of Unknown Organism:primate 

25 <220* 

<221> CDS 

<222> (45).. (1256) 

<400> 53 

30 gaattcggca cgagggcgag gttttataca cctgaaagaa gaga atg tea aga cga 56 

Met Ser Arg Arg 

1 

agt age cgt tta cm get oog cag cag ccc cag ccc age cag acg gaa 104 
35 Ser Ser Arg Leu Gin Ala Lys Gin Gin Pro Gin Pro Ser Gin Thr Glu 
5 10 15 20 

tec ccc caa gaa gee cag ata ate cag gee aag aag agg aaa act acc 152 
Ser Pro Gin Glu Ala Gin He He Gin Ala Lys Lys Arg Lys Thr Thr 
40 25 30 35 

cag gat gtc aaa aaa aga aga gag gag gtc acc aag aaa cat cag tat 200 
Gin Asp Val Lys Lys Arg Arg Glu Glu Val Thr Lys Lys His Gin Tyr 
40 45 50 

45 

gaa att agg aat tgt tgg cca cct gta tta tct ggg ggg ate agt cct 248 
Glu He Arg Asn Cys Trp Pro Pro Val Leu Ser Gly Gly He Ser Pro 
55 60 65 

50 tgc att ate att gaa aca cct cac aaa gaa ata gga aca agt gat ttc 296 
Cys He He lie Glu Thr Pro His Lys Glu He Gly Thr Ser Asp Phe 
70 75 80 

tec aga ttt aca aat tac aga ttt aaa aat ctt ttt att aat cct tea 344 
55 Ser Arg Phe Thr Asn Tyr Arg Phe Lys Asn Leu Phe He Asn Pro Ser 
85 90 95 100 

cct ttg cct gat tta age tgg gga tgt tea aaa gaa gtc tgg eta aac 392 
Pro Leu Pro Asp Leu Ser Trp Gly Cys Ser Lys Glu Val Trp Leu Asn 
60 105 110 115 

atg tta aaa aag gag age aga tat gtt cat gac aaa cat ttt gaa gtt 440 
Met Leu Lys Lys Glu Ser Arg Tyr Val His Asp Lys His Phe Glu Val 
120 125 130 
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ctg cat tct gac ttg gaa cca cag atg agg tec ata ctt eta gac tgg 488' 
5 Leu His Ssr Asp Leu Glu Pro Gin Met Arg Ser lie Leu Leu Asp Trp 
135 140 145 

ctt tta gag gta tgt gaa gta tac aca ctt cat agg gaa aca ttt tat 536 
Leu Leu Glu Val Cys Glu Val Tyr Thr Leu His Arg Glu Thr Phe Tyr 
10 150 155 160 

ctt gca caa gac ttt ttt gat aga ttt atg ttg aca caa aag gat ata 584 

Leu Ala Gin Asp Phe Phe Asp Arg Phe Met Leu Thr Gin Lys Asp lie 

165 170 175 180 

15 

aat aaa aat atg ctt caa etc att gga att acc ten tta ttc at; get 632 

Asn Lys Asn Met Leu Gin Leu lie Gly lie Thr Ser Leu Phe lie Ala 

185 190 195 

20 tec aaa ctt gag gaa ate tat get cct aaa etc caa gag ttt get tac 680 
Ser Lys Leu Glu Glu He Tyr Ala Pro Lys Leu Gin Glu Phe Ala Tyr 
200 205 210 

gtc act gat ggt get tgc agt gaa gaa gat ate tta agg atg gaa etc 728 
25 Val Thr Asp Gly Ala Cys Ser Glu Glu Asp lie Leu Arg Met Glu Leu 
215 220 225 

att ata tta aag get tta aaa tgg gaa ctt tgt cct gta aca ate ate 776 
lie lie Leu Lys Ala Leu Lys Trp Glu Leu Cys Pro Val Thr He He 
30 230 235 240 

tec tgg eta aat etc ttt etc caa gtt gat get ctt aaa gat get cct 824 

Ser Trp Leu Asn Leu Phe Leu Gin Val Asp Ala Leu Lys Asp Ala Pro 
245 250 255 260 

35 

aaa gtt ctt eta cct cag tat tct cag gaa aca ttc att caa ata get 872 

Lys Val Leu Leu Pro Gin Tyr Ser Gin Glu Thr Phe He Gin He Ala 
265 270 275 

40 cag ctt tta gat ctg tgt a^t eta gee att gat tea tta gag ttc cag 920 
Gin Leu Leu Asp Leu Cys He Leu Ala He Asp Ser Leu Glu Phe Gin 
280 285 290 

toe aga ata ctg act get get gec ttg tgc cat ttt acc tec att gaa 968 
45 Tyr Arg He Leu Thr Ala Ala Ala Leu Cys His Phe Thr Ser He Glu 
295 300 305 

gtg gtt aag aaa gec tea ggt ttg gag tgg gac agt att tea gaa tgt 1016 
Val Val Lys Lys Ala Ser Gly Leu Glu Trp Asp Ser He Ser Glu Cys 
50 310 315 320 

gta gat tgg atg gta cct ttt gtc aat gta gta aaa agt act agt cca 1064 

Val Asp Trp Met Val Pro Phe Val Asn Val Val Lys Ser Thr Ser Pro 

325 330 335 340 

55 

gtg aag Ctg aag act ttt aag aag att cct atg gaa gac aga cat aat 1112 

Val Lys Leu Lys Thr Phe Lys Lys He Pro Met Glu Asp Arg His Asn 

345 350 355 

60 ate cag aca cat aca aac tat ttg get atg ctg gag gaa gta aat tac 1160 
He Gin Thr His Thr Asn Tyr Leu Ala Met Leu Glu Glu Val Asn Tyr 
360 365 370 
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ata aac acc ttc aga aaa ggg gga cag ttg tea cca gtg tgc aat gga 1208 

5 lie Asn Thr Phe Arg Lys Gly Gly Gin Leu Ser Pro Val Cys Asn Gly 
375 380 385 

ggc att atg aca cca ccg aag age act gaa aaa cca cca gga aaa cac 1256 

Gly He Met Thr Pro Pro Lys Ser Thr Glu Lys Pro Pro Gly Lys His 
10 390 395 400 



15 



taaagaagat aactaagcaa acaagttgga attcaccaag attgggtaga actggtatca 1316 
ctgaactact aaagttttac agaaagtagt gctgtgattg attgecctag ccaattcaca 1376 
agttacaccg ccattctgat tttaaaaccc acaactggca ctaaagaaca catttaatta 1436 
tttcctatgt tagctgttaa agaaacagca ggacttgttt acaaagatgt cttcattccc 1496 
20 anggttactg gatagaagee aaccacagtc tataccatag caacgttttt cctttaatcc 1556 
agtgttactg tgtttatctt gataaactag gaattttgtc actggagttt tggactggat 1616 
aagtgctacc ttaaagggta tactaagtga tacagtactt tgaatctagt tgttagattc 1676 
tcaaaattcc tacactcttg actagtgcaa tttggttctt gaaaattaaa tttaaacttg 1736 
tttacaaagg tttagttttg taataaggtg actaatttat etatagctge tatagcaagc 1796 
tattataaaa cttgaatttc tacaaatggt gaaatttaat gttttttaaa ctagtttatt 1856 
tgccttgcca taacacattt tttaactaat aaggcttaga tgaacatggt gttcaacctg 1916 
tgctctaaac agtgggagta ccaaagaaat tataaacaag ataaatgetg tggctccttc 1976 
ctaactgggg ctttcttgac atgtaggttg cttggtaata acctttttgt atatcacaat 2036 
ttgggtgaaa aacctaagta ccctttcaaa ctatttatat gaggaagtca ctttactact 2096 
40 ctaagatacc cctaaggaat tttttttttt aatttagtgt gactaaggct ttatttatgt 2156 
ttgtgaaact gttaaggtcc tttctaaatt cctccattgt gagataagga cagtgtcaaa 2216 
gtgataaagc ttaacacttg acctaaactt ctattttctt aaggaagaag agtattaaat 2276 

45 

atatactgac tcctagaaat ctatttatta aaaaaagaca tgaaaacttg ctgtacatag 2336 
gctagctatt tctaaatatt ttaaattagc ttttctaaaa aaaaaatcca gectcataaa 2396 
50 gtagattaga aaactagatt gctagtttat tttgttatca gatatgtgaa tctcttctcc 2456 
ctttgaagaa actatacatt tattgttacg gtatgaagtc ttctgtatag tttgttttta 2516 
aactaatatt tgtttcagta ttttgtctga aaagaaaaca ccactaattg tgtacatatg 2576 
tattatataa acttaacctt ttaatactgt ttatttttag cccatgttta aaaaataaaa 2636 
gttaaaaaea tttaactget aaaaaaaaaa aaaaaaaagt gcggccgc 2684 



25 



30 



35 



55 



60 



<210> 54 
<211> 404 
<212> PRT 
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<213> Unknown 

5 

<400> 54 

Met Ser Arg Arg Ser Ser Arg Leu Gin Ala Lys Gin Gin Pro Gin Pro 
15 10 15 

10 Ser Gin Thr Glu Ser Pro Gin Glu Ala Gin lie He Gin Ala Lys Lys 
20 25 30 

Arg Lys Thr Thr Gin Asp Val Lys Lys Arg Arg Glu Glu Val Thr Lys 
35 40 45 

15 

Lys His Gin Tyr Glu He Arg Asn Cys Trp Pro Pro Val Leu Ser Gly 
50 55 60 

Gly He Ser Pro Cys He He He Glu Thr Pro His Lys Glu He Gly 
20 65 70 75 80 

Thr Ser Asp Phe Ser Arg Phe Thr Asn Tyr Arg Phe Lys Asn Leu Phe 
85 90 95 

25 He Asn Pro Ser Pro Leu Pro Asp Leu Ser Trp Gly Cys Ser Lys Glu 
100 105 110 

Val Trp Leu Asn Met Leu Lys Lys Glu Ser Arg Tyr Val His Asp Lys 
115 120 125 

30 

His Phe Glu Val Leu His Ser Asp Leu Glu Pro Gin Met Arg Ser He 
130 135 140 

Leu Leu Asp Trp Leu Leu Glu Val Cys Glu Val Tyr Thr Leu His Arg ' 
35 145 150 155 160 

Glu Thr Phe Tyr Leu Ala Gin Asp Phe Phe Asp Arg Phe Met Leu Thr 
165 170 175 

40 Gin Lys Asp He Asn Lys Asn Met Leu Gin Leu He Gly He Thr Ser 
180 165 190 

Leu Phe He Ala Ser Lys Leu Glu Glu He Tyr Ala Pro Lys Leu Gin 
195 200 205 

45 

Glu Phe Ala Tyr Val Thr Asp Gly Ala Cys Ser Glu Glu Asp lie Leu 
210 215 220 

Arg Met Glu Leu He He Leu Lys Ala Leu Lys Trp Glu Leu Cys Pro 
50 225 230 235 240 

Val Thr He He Ser Trp Leu Asn Leu Phe Leu Gin Val Asp Ala Leu 
245 250 255 

55 Lys Asp Ala Pro Lys Val Leu Leu Pro Gin Tyr Ser Gin Glu Thr Phe 
260 265 270 

He Gin He Ala Gin Leu Leu Asp Leu Cys He Leu Ala He Asp Ser 
275 280 285 

60 

Leu Glu Phe Gin Tyr Arg He Leu Thr Ala Ala Ala Leu Cys His Phe 
290 295 300 
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Thr Ser lie Glu Val 
5 305 

lie Ser Glu Cys Val 
325 

10 Ser Thr Ser Pro Val 
340 

Asp Arg His Asn lie 
355 

15 

Glu Val Asn Tyr lie Asn Thr Phe Arg Lys Gly Gly Gin Leu Ser Pro 
370 375 380 

Val Cys A3n Gly Gly lie Met Thr Pro Pro Lys Ser Thr Glu Lys Pro 
20 385 390 395 400 

Pro Gly Lys His 



Val Lys Lys Ala Ser Gly Leu Glu Trp Asp Ser 
310 315 320 

Asp Trp Met Val Pro Phe Val Asn Val Val Lys 
330 335 

Lys Leu Lys Thr Phe Lys Lys lie Pro Met Glu 
345 350 

Gin Thr His Thr Asn Tyr Leu Ala Met Leu Glu 
360 365 



30 



25 

<210> 55 

<211> 1764 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organismrprimate 

<220> 
35 <221> CDS 

<222> (26).. (1210) 

<400> 55 

ccgggatgcg aaggagcggg acacc atg aag gag gac ggc ggc gcg gag ttc 52 
40 Met Lys Glu Asp Gly Gly Ala Glu Phe 

1 5 

teg get cgc tec agg aag agg aag gca aac gtg acc gtt ttt ttg cag 100 

Ser Ala Arg Ser Arg Lys Arg Lys Ala Asn Val Thr Val Phe Leu Gin 
45 10 15 20 25 

gat cca gat gaa gaa atg gec aaa ate gac agg acg gcg agg gac cag 148 
Asp Pro Asp Glu Glu Met Ala Lys lie Asp Arg Thr Ala Arg Asp Gin 
30 35 40 



50 

tgt ggg age cag cct tgg gac aat aat gca gtc tgt gca gac ccc tgc 196 

Cys Gly Ser Gin Pro Trp Asp Asn Asn Ala Val Cys Ala Asp Pro Cys 

45 50 55 

55 tec ctg ate ccc aca cct gac aaa gaa gat gat gac egg gtt tac cca 244 

Ser Leu lie Pro Thr Pro Asp Lys Glu Asp Asp Asp Arg Val Tyr Pro 

60 65 70 

aac tea acg tgc aag cct egg att att gca cca tec aga ggc tec ccg 292 

60 Asn Ser Thr Cys Lys Pro Arg lie lie Ala Pro Ser Arg Gly Ser Pro 

75 80 85 

ctg cct gta ctg age tgg gca aat aga gag gaa gtc tgg aaa ate atg 340 
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Leu Pro Val Leu Ser Trp Ala Asn Arg Glu Glu Val Trp Lys lie Met 
90 95 100 105 

tta aac aag gaa aag aca tac tta agg gat cag cac ttt ctt gag caa 388 
Leu Asn Lys Glu Lys Thr Tyr Leu Arg Asp Gin His Phe Leu Glu Gin 
110 115 120 

cac cct ctt ctg cag cca aaa atg cga gca att ctt ctg gat tgg tta 436 
His Pro Leu Leu Gin Pro Lys Met Arg Ala lie Leu Leu Asp Trp Leu 
125 130 135 

15 atg gag gtg tgt gaa gtc tat aaa ctt cac agg gag acc ttt tac ttg 484 
Met Glu Val Cys Glu Val Tyr Lys Leu His Arg Glu Thr Phe Tyr Leu 
140 145 150 

gca caa gat ttc ttt gac egg tat atg gcg aca caa gaa aat gtt gta 532 
2 0 Ala Gin Asp Phe Phe Asp Arg Tyr Met Ala Thr Gin Glu Asn Val Val 
155 160 165 

aaa act ctt tta cag ctt att ggg att tea tct tta ttt att gca gec 580 
Lya Thr Leu Leu Gin Leu lie Gly lie Ser Ser Leu Phe lie Ala Ala 
25 170 175 180 185 

aaa ctt gag gaa ate tat cct cca aag ttg cac cac ttt gcg tat gtg S28 

Lys Leu Glu Glu lie Tyr Pro Pro Lys Leu His Gin Phe Ala Tyr Val 

190 195 200 

30 

aca gat gga get tgt tea gga gat gaa att etc acc atg gaa tta atg 676 

Thr Asp Gly Ala Cys Ser Gly Asp Glu lie Leu Thr Met Glu Leu Met 
205 210 215 

35 att atg aag gec ctt aag tgg cgt tta agt ccc ctg act att gtg tec 724 
lie Met Lys Ala Leu Lys Trp Arg Leu Ser Pro Leu Thr lie Val Ser 
220 225 230 

tgg ctg aat gta tac atg cag gtt gca tat eta aat gac tta cat gaa 772 
40 Trp Leu Asn Val Tyr Met Gin Val Ala Tyr Leu Asn Asp Leu His Glu 
235 240 245 

gtg eta ctg ccg cag tat ccc cag caa ate ttt ata cag att gca gag 820 
Val Leu Leu Pro Gin Tyr Pro Gin Gin lie Phe lie Gin He Ala Glu 
45 250 255 260 265 

ctg ttg gat etc tgt gtc ctg gat gtt gac tgc ctt gaa ttt cct tat 868 

Leu Leu Asp Leu Cys Val Leu Asp Val Asp Cys Leu Glu Phe Pro Tyr 
270 275 280 

50 

ggt ata ctt get get teg gec ttg tat cat ttc teg tea tct gaa ttg 916 

Gly lie Leu Ala Ala Ser Ala Leu Tyr His Phe Ser Ser Ser Glu Leu 

285 290 295 

55 atg caa aag gtt tea ggg tat cag tgg tgc gac ata gag aac tgt gtc 964 
Met Gin Lys Val Ser Gly Tyr Gin Trp Cys Asp He Glu Asn Cys Val 
300 305 310 

aag tgg atg gtt cca ttt gec atg gtt ata agg gag acg ggg age tea 1012 
60 Lys Trp Met Val Pro Phe Ala Met Val He Arg Glu Thr Gly Ser Ser 
315 320 325 

aaa ctg aag cac ttc agg ggc gtc get gat gaa gat gca cac aac ata 1060 
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Lys Leu Lys His Phe Arg Gly Val Ala Asp Glu Asp Ala His Asn lie 
330 335 340 345 

cag acc cac aga gac age ttg gat ttg ctg gac aaa gec cga gca aag 1108 
Gin Thr His Arg Asp Ser Leu Asp Leu Leu Asp Lys Ala Arg Ala Lys 
350 355 3 60 

aaa gec atg ttg tct gaa caa aat agg get tct cct ccc ccc agt ggg 1156 
Lys Ala Met Leu Ser Glu Gin Asn Arg Ala Ser Pro Leu Pro Ser Gly 



365 370 375 

15 etc etc acc ccg cca cag age ggt aag aag cag age age ggg ccg gaa 

Leu Leu Thr Pro Pro Gin Ser Gly Lys Lys Gin Ser Ser Gly Pro Glu 
380 385 390 



1204 



atg gcg tgaccacccc atccttctec accaaagaca gttgegcege tgctccacgt 1260 
20 Met Ala 



25 



395 














tctcttctgt 


ctgttgcagc 


ggaggcgtgc 


gtttgctttt 


acagatatct 


gaatggaaga 


1320 


gtgtttcttc 


cacaacagaa 


gtatttctct 


ggatggcatc 


aaacagggca 


aagtgttttt 


1380 


tattgaatgc 


ttataggttt 


tttttaaata 


agtgggtcaa 


gtacaccagc 


cacc tccaga 


1440 


caccagtgcg 


tgctcccgat 


getgetatgg 


aaggtgctnc 


ttgacctaag 


ggactcccac 


1500 


aacaacaaaa 


gcttgaagct 


gtggaggcgc 


acggtggcgt 


ggctctcctc 


gcaggtgttc 


1560 


tgggctccgt 


tgtaccaagt 


ggagcaggtg 


gttgegggea 


agcgttgtgc 


agageccata 


1620 


gccagctggg 


cagggggctg 


ccctctccac 


attatcagtt 


gacagtgtac 


aatgcctttg 


1680 


atgaactgtt 


ttgtaagtgc 


tgetatatet 


atccattttt 


taataaagct 


aatactgttt 


1740 


ctttagagca 


cactggcggg 


tegt 








1764 



30 



35 



40 

<210> 56 
<211> 395 
<212> PRT 
45 <213> Unknown 

<400> 56 

Met Lys Glu Asp Gly Gly Ala Glu Phe Ser Ala Arg Ser Arg Lys Arg 
15 10 15 

Lys Ala Asn Val Thr Val Phe Leu Gin Asp Pro Asp Glu Glu Met Ala 
20 25 30 



Lys lie Asp Arg Thr Ala Arg Asp Gin Cys Gly Ser Gin Pro Trp Asp 
55 35 40 45 

Asn Asn Ala Val Cys Ala Asp Pro Cys Ser Leu lie Pro Thr Pro Asp 
50 55 60 

60 Lys Glu Asp Asp Asp Arg Val Tyr Pro Asn Ser Thr Cys Lys Pro Arg 
65 70 75 80 

lie He Ala Pro Ser Arg Gly Ser Pro Leu Pro Val Leu Ser Trp Ala 
85 90 95 
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Asn Arg Glu Glu Val Trp Lys lie Met Leu Asn Lys Glu Lys Thr Tyr 
5 100 105 110 

Leu Arg Asp Gin His Phe Leu Glu Gin His Pro Leu Leu Gin Pro Lys 
115 120 125 

10 Met Arg Ala He Leu Leu Asp Trp Leu Met Glu Val Cys Glu Val Tyr 
130 135 140 

Lys Leu His Arg Glu Thr Phe Tyr Leu Ala Gin Asp Phe Phe Asp Arg 
145 150 155 160 

15 

Tyr Met Ala Thr Gin Glu Asn Val Val Lys Thr Leu Leu Gin Leu He 
165 170 175 

Gly He Ser Ser Leu Phe lie Ala Ala Lys Leu Glu Glu He Tyr Pro 
20 180 185 190 

Pro Lys Leu His Gin Phe Ala Tyr Val Thr Asp Gly Ala Cys Ser Gly 
195 200 205 

25 Asp Glu He Leu Thr Met Glu Leu Met He Met Lys Ala Leu Lys Trp 
210 215 220 

Arg Leu Ser Pro Leu Thr He Val Ser Trp Leu Asn Val Tyr Met Gin 
225 230 235 240 

30 

Val Ala Tyr Leu Asn Asp Leu His Glu Val Leu Leu Pro Gin Tyr Pro 
245 250 255 

Gin Gin He Phe lie Gin He Ala Glu Leu Leu Asp Leu Cys Val Leu 
35 2S0 265 270 

Asp Val Asp Cys Leu Glu Phe Pro Tyr Gly He Leu Ala Ala Ser Ala 
275 280 285 

40 Leu Tyr His Phe Ser Ser Ser Glu Leu Met Gin Lys Val Ser Gly Tyr 
290 295 300 

Gin Trp Cys Asp He Glu Asn Cys Val Lys Trp Met Val Pro Phe Ala 
305 310 315 320 

45 

Met Val He Arg Glu Thr Gly Ser Ser Lys Leu Lys His Phe Arg Gly 
325 330 335 

Val Ala Asp Glu Asp Ala His Asn lie Gin Thr His Arg Asp Ser Leu 
50 340 345 350 

Asp Leu Leu Asp Lys Ala Arg Ala Lys Lys Ala Met Leu Ser Glu Gin 
355 3S0 365 

55 Asn Arg Ala Ser Pro Leu Pro Ser Gly Leu Leu Thr Pro Pro Gin Ser 
370 375 380 

Gly Lys Lys Gin Ser Ser Gly Pro Glu Met Ala 
385 390 395 

60 
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